WordScan and WordScan Plus 


Microsoft Windows Edition, Version 1.1 


Installation Notes for 
Scanners and Fax Cards 


Manual Addendum 


December 13, 1991 


This addendum provides information you need to install several 
newly-supported scanners with WordScan, WordScan Plus, and 
WordScan Plus/AT. 


Part Number: TP-00272-00-A 


Chinon DS-3000 (N-207) 


Chinon The Chinon DS-3000 is a single page overhead scanner with a 
DS-3000 serial and parallel interface port. WordScan supports the scanner 

using the parallel port together with the Chinon parallel interface 
(N-2 07) card. 


System Support Because of the overhead style of the scanner, ambient light is used 
PC/AT: Yes for scanning. Recognition results may vary depending on the 
lighting. If you are not getting good results scanning the sample 
pages supplied with WordScan, you may want to try moving the 
scanner or changing the lighting. 


PS/2 MCA: No 
ADF: No 


Installation Instructions 


1. Install the scanner according to the installation instructions in- 
cluded with it. 


2. Copy the file CHINON.SYS from the diskette supplied with the 
scanner to the root directory of the disk from which your com- 
puter boots. 


= Important: Calera has tested version 4 of the CHINON.SYS 
driver. Other versions may not work. Contact Chinon to ob- 
tain the correct version of the driver file, if necessary. 


3. Ifthere is not already one there, add the following line to your 
COMFIG.SYS file: 


device=chinon.sys /p /t1 /u /a 

The parameters on this line have the following meanings: 
/p Use the parallel port. 
/t1 Set the transmission type to 1. 


/u Do not compress the data before sending itto the 
computer. 


/a The scanner is a Model DS-3000 (N-207). 


If you do not supply the above parameters, the scanner will 
use the following default parameters: Chinon Model DS-2000, 
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wil 


Datacopy Model 730GS 


serial interface at 19,200 baud, compressed data. These 
parameters will not work with WordScan. 


4. Reboot your system by pressing <Ctrl> + <Alt> + <Deb. 


Document Feeding Flatbed: 


Features 
Supported in 
WordScan 


Datacopy 
Model 730GS 


System Support 
PC/AT: Yes 


PS/2 MCA: No 
ADF: Yes 


Coe | 


Portrait: Face up with the top of the page to the right 


Landscape: Face up with top of text toward the back 


Text Scanning Resolution (dpi): 200, 300 


Image Scanning Resolution (dpi): 75, 150, 200, 300, 
Dither Patterns: Bayer 


Gray Scale: None 


The WordScan Installation Notes for Scanners and Fax Cards 
(part number TA-00052-02-A) incorrectly includes the Datacopy 


730GS. Disregard that information and instead use the following 
instructions. 


The Datacopy 730GS is a sheetfed scanner that has reasonably 
good brightness control; however, when choosing darker settings, 
the page tends to become darker quickly, producing a black page 
when set at or below 24%. Use the Preview window to check 
image quality when adjusting the Brightness setting. 


== Important: When running Microsoft Windows in 386 En- 
hanced mode, this scanner fails when scanning the second 
page of a document. If you are using WordScan on a 386 com- 
puter, you must ensure that Windows is running in Standard 


mode by starting Windows with the command win /s 
wordscan <Enter>. 
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Datacopy Model 730GS 


Installation 1. Install the scanner according to the installation instructions in- 
Instructions cluded with it. 


2. Copy the file XISDCP30.SYS from the diskette supplied with 
your scanner to the root directory of the disk from which your 
computer boots. 


3. Ifthere is not already one there, add the following to your 
CONFIG.SYS file: 


DEVICE = XISDCP30.SYS=2E8 


substituting your scanner interface card's I/O address for 2E8 
if you have changed the default setting. (Note that if the ad- 
dress is specified incorrectly, WordScan will not be able to 
communicate with your scanner.) 


4. Reboot your system by pressing <Ctrl> + <Alt> + <Del>. 
Document Feeding Sheetfeeder: 
Portrait: Face down with top of text entering scanner first 


Landscape: Face down with top of text against right side of 


feeder 
Features Text Scanning Resolution (dpi): 200, 240, 300, 400 
Supported in 
WordScan Image Scanning Resolution (dpi): 200, 240, 300, 400 
Dither Patterns: 4x4 Random, 4x4 Spiral, 
Small Diag, Med Diag, 
Large Diag, XLarge Diag 
Gray Scale: None 


wi 
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Epson ES-300C 


Epson ES-300C 


System Support 
PC/AT: Yes 


PS/2 MCA: Yes 
ADF: No 


The Epson ES-300C was supported by WordScan version 1.0 on 
PC/AT computers only. WordScan version 1.1 adds support for 
this scanner on PS/2 MCA systems. Refer to your Installation 
Notes for Scanners and Fax Cards for complete installation and 
page orientation information. 


Hewlett-Packard wordscan version 1.1 adds support for the Scanjet IIC. Follow the 


Scanjet IC 


System Support 
PC/AT: Yes 

PS/2 MCA: Yes 

ADF: Yes 


Howtek 
Personal 
Color Scanner 


System Support 
PC/AT: Yes 


PS/2 MCA: No 
ADF: No 


Installation 
Instructions 


su 


installation and page orientation instructions for the Scanjet Plus 
located in your Installation Notes for Scanners and Fax Cards. 


The Howtek Personal Color Scanner is a flatbed scanner that can 
scan letter and legal size pages. 


This scanner has excellent brightness control. You can generally 
improve text recognition on particularly light or dark pages by 
using the brightness control when scanning. 


Install the scanner according to the installation instructions 
included with it, then install WordScan normally. This scanner 


does not require any DOS drivers to be loaded in your config.sys 
file. 


WordScan Installation Notes Addendum 


Operation 
Instructions 


Document Feeding 


Features 
Supported in 
WordScan 
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Howtek Personal Color Scanner 


1. Make sure that the READY light on the front of the scanner is 
on before starting WordScan. This may take up to 30 seconds 
from when the scanner is first turned on. 

2. The scanner doesn't have a time-out feature. This means that, 
for example, if the scanner cable should become disconnected 
during a scan, you must power-cycle the scanner and reboot 
your computer. 

3. The scanner recalibrates itself frequently. This can take up to 
20 seconds. 

4. Black and white scanning is done with green (rather than 
white) light, so scanning color images may produce unex- 
pected results (the green and yellow portions of the image 
may disappear). 

Flatbed: 

Portrait: Face down with upper left corner of the page in 
the back right corner of the scanner, aligned with 
the white arrow 

Landscape: Face down with top of text against right side of 
scanner 

Text Scanning Resolution (dpi): 200, 240, 300 


Image Scanning Resolution (dpi): 40, 48, 50, 70, 75, 80,100, 


120, 150, 200, 240, 300 


Dither Patterns: None 


Gray Scale: None 


NI 


IBM 3119 PageScanner 


IBM 3119 
PageScanner 


System Support 
PC/AT: No 


PS/2 MCA: Yes 
ADF: No 


Installation 
Instructions 


Document Feeding 


Features 
Supported in 
WordScan 


The IBM 3119 is a flatbed scanner. It has three brightness settings 
that give you the ability to compensate somewhat for light or dark 
pages. The top of this scanner moves when scanning a page. Be 
sure to leave space on each side of the scanner to allow for this 
movement. The 3119 can handle page images up to 11.7" 

(29.7 cm) long. 


1. Install the 3119 Adapter/A card and configure it using the IBM 
PageScanner Option Diskette, version 1.01 or later. 


2. Install the scanner according to the installation instructions in- 
cluded with it, then install WordScan normally. This scanner 


does not require any DOS drivers to be loaded in your con- 
fig.sys file. 


Flatbed: 


Portrait: Face down with the top of text against the left 
side of the scanner. The page should be aligned 
with the left and rear sides of the scanner. 


Landscape: Face down with top of text against rear of the 
scanner. The page should be aligned with the left 
and rear sides of the scanner. 


Text Scanning Resolution (dpi): 200, 240, 300 


Image Scanning Resolution (dpi): 40, 48, 50, 70, 75, 80, 100, 
120, 150, 200, 240, 300 


Dither Patterns: Super-circle, Mesh, 
Super-line 
Gray Scale: None 
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Princeton 
Graphics 
LS-300 


System Support 
PC/AT: Yes 


PS/2 MCA: No 
ADF: Yes 


Installation 
Instructions 


Features 
Supported in 
WordScan 
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Princeton Graphics LS-300 


The Princeton Graphics LS-300 is a sheetfed scanner that holds up 
to 5 pages in its feeder. 


When multiple pages are loaded into the feeder at one time, the 
scanner has a tendency to skew and double-feed pages as they 
are fed through. We recommend watching carefully for this 
problem, as recognition will be affected if pages are skewed. 


There is virtually no brightness control on this scanner. This 
means that when using WordScan, you will see little difference 
when changing the Setup window Brightness control. Therefore, 
recognition on particularly light or dark pages cannot be 
improved by changing the brightness setting. We recommend 
using a photocopier to try to lighten or darken pages of this type. 


Note that text or graphics in many bright yellow inks is invisible to 
this scanner and will not be recognized. 


Install the scanner according to the installation instructions 
included with it, then install WordScan normally. This scanner 
does not require any DOS drivers to be loaded in your config.sys 
file. 


HEAD 2 = Document Feeding 


Sheet Feeder 


Portrait: Face down with top of text going into feeder first 


Landscape: Face down with top of text against right side of 


feeder 
Supported Scanning Resolutions (dpi): 75, 150, 200, 300 
Dither Patterns: 4x8 Fatting 
Gray Scale: None 


Ol 


UMAX UF-32 and UF-42 


UMAX UF-32 
and UF-42 


System Support 
PC/AT: Yes 


PS/2 MCA: Yes 
ADF: Yes 


Installation 
Instructions 


Document Feeding 


Features 
Supported in 
WordScan 


The UMAX UF-32 is a black and white flatbed scanner. The UMAX 
UF-42 is a grayscale flatbed scanner. Both scanners have an optional 
automatic document feeder with a capacity of approximately 25 
pages. Both scanners install and operate identically with 
WordScan, and will scan pages up to 35 cm (14") long from 
flatbed or ADF. These scanners have excellent brightness control. 
You can generally improve recognition on by using WordScan’s 
brightness control. 


Install the scanner according to the installation instructions 
included with it. Your CONFIG.SYS entry should look like this: 


device=uscan.sys 


Copy the USCAN.SYS driver from the UMAXScan diskette to the 
root directory of the disk from which your computer boots. 
Finally, reboot your system by pressing <Ctrl>+<Alt>+<Deb, 
then follow the normal WordScan installation procedures. Choose 
UMAX when prompted to select a scanner. 


Sheet Feeder: 


Portrait: Face down with upper left corner of page at front 
left corner of scanner, aligned with the page icon 


Landscape: Face down with top of text against left edge of scanner 
ADF: 
Portrait: Face down with top of text going in first 


Landscape: Face down with top of text against left side of feeder 


Text Scanning Resolution (dpi): 200, 240, 300 


Image Scanning Resolution (dpi): 48, 50, 60, 75, 80, 100, 
120, 200, 240, 300 


Dither Patterns: 12x12 #1, 12x12 #2, 8x8 #1, 
8x8 #2, 8x8 #3, 8x8 #4, 8x8 #5, 
6x6 #1, 6x6 #2, 6x6 #3, 4x4 #1, 
4x4 #2, 4x4 #3, 2x2 #1,2x2 #2 
Gray Scale: None 
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WordScan and WordScan Plus 


Microsoft Windows Edition, Version 1.1 


User’s Guide 


Manual Addendum 


December 13, 1991 


This addendum provides information on changes made to 
WordScan and WordScan Plus. 


Part Number: TP-00274-00-A 


New Progress Monitor 


New 
Progress 
Monitor 


When using WordScan and WordScan Plus for background 
processing, displaying the progress monitor slows recognition 
performance and can interfere with whatever foreground 
application you are using. WordScan and WordScan Plus version 
1.1 add the ability to disable the progress monitor. 


To disable the progress monitor, open the Options menu and 
choose Progress Monitor. 


[Options] 


Text Settings... 

Scanner Settings... 
Pop-Up Verifier... 
User Dictionary... 


Style Sheet... 
Preview Settings... 


Defer Processing 
Process Deferred Docs... 


Vv Progress Monitor 


[Options 


Text Settings... 

Scanner Settings... 
Pop-Up Verifier... 
User Dictionary... 


Style Sheet... 
Preview Settings... 


Defer Processing 
Process Deferred Docs... 


Progress Monitor 


This is a toggle. When a check mark appears next to Progress 
Monitor, the progress monitor is enabled. When no check mark 
appears, the progress monitor is disabled. 


When the progress monitor is disabled, the following dialog box 
appears on your screen during recognition: 


Text Only 


Page 1: Processing 


No progress is indicated. When processing is complete, the dialog 
box disappears. If you want to cancel processing, click the Cancel 
button. 


D WordScan™ and WordScan Plus Quick Start Guide 
/ Installing 


Ö Install your scanner (or your fax 
card) by following the instruc- 
tions included with it. Be sure it 
works before proceeding. 


x Install Microsoft Windows 
version 3.0 or higher by 
| following the instructions 


included with it. Be sure it works 
before proceeding. 


x If applicable, install your 
WordScan Plus/AT or MCA 
recognition card by following the 
instructions in Appendix B of the 
WordScan Plus User’s Guide. 


—n 
En = 
x Insert the WordScan Plus #1 Microsoft MS-DOS Version 3.38 
Installation Disk into your 
diskette drive and type: C:\> atinstall 
(c)1998, Calera Recognition Systems, Inc. All Rights Reserved 
a:install <Enter> VordScan Plus INSTALL Version 1.8 
Please wait..._ 


\ Follow the instructions on your 
screen to complete installation of 
WordScan Plus. 


Quick Start: 
' Launching 


Use either of the following methods to launch WordScan Plus: 


=» Type: 


win /s wordscan <Enter> 


at the DOS prompt.* 


Double-click on the WordScan 
Plus program group icon. 


Double-click on the WordScan 
Plus program icon. 


* If you did not have the Installation Program modify your AUTOEXEC.BAT file, you must type wordinit 
or wordexec prior to starting Windows or WordScan Plus. Se 


Guide for more information. 


| File Options Window Help 


i Program Manager 


WordScan Plus 
program 
group icon 


RRR 


Accessories Games Main \WordScan Plus 


Program Manager 
File Options Window Help 


WordScan Plus 


SEEM Wordsean Plus 


program icon 


Main 


Accessories Games 
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Quick Start: 
Automatic Recognition 


To automatically recognize pages, use WordScan Plus’ Automatic mode: 


Put one or more pages in your 
scanner. 


Change settings in the Setup 
window, if necessary. (Set 
Reading For, Page Orientation, 
Type Quality, Image Source, and 
Brightness to meet the needs of 
your pages.) 


Click Automatic to start 
processing. 


=» Wait while WordScan Plus 
processes your pages. The 
Progress Monitor indicates the 
progress of recognition. 
Ə Choose Save from the File menu 


(or proof your document as 
described later). 


Setup Window - Untitled (0 pages) 
Reading for 

E c pa E 
Page Orientation 


o E 


Type Quality 
E Normal 
Image Source 
Scanner 


Start Processing 


Automatic 


Text Only 


Portrait 


Click Automatic to 
start processing 


page 1: 36 %done 


Cancel Processing 


Type a document Click Save to save your 
finished document 


name here 


SN Save As Document... N 0 
Document Name: 


Text: |ASCII 
Imagr/ |PageMaker (TIFF) 


X Delete All Intermediate Files 
Save Markers As: 


Directory: c:\calera\bin 


Directories: 


Select a directory Select a text 


here format here 


Quick Start: 
/ Preview Window 


To selectively recognize portions of your pages, use 


WordScan Plus’ Preview mode: 


Put one or more pages in your 
scanner. 


Click Preview to scan the first 
page. 


Select a tool for the type of zone 
you want to draw. 


Draw zones* around the areas 
you want to process. 


Click Process One to process the 
zones on the page and scan the 
next page (if any). 


When you are finished, click Stop 
Scanning or Done (depending 
on your scanner). 


Choose Save from the File menu 
(or proof your document as 
described elsewhere). 


= 
E 
“a 
B 
R- 


Click Preview to open the 
Preview window and 
scan the first page 


-E BETS 50% 
lighter 


Start Processing 
ron: eee 


Selects existing zones 


Draws text zones 
Draws image zones 


Draws text and image zones 


Draws numeric zones 


Changes order of zones* 


Click Process One 
to process this page N Preview 


Image Zone 


Text Zone 


Click Save to save your 
finished document 


— Save As Document... No 


Type a document 
name here 


Document Name: 


=a 


ASCII z 
PageMaker (TIFF) [2] 


RI Delete All Intermediate Files 
CO Save Markers As: GE 


Directory: c:\calera\bin 


Directories: 


Text: 


Select a directory Select text and image 
here formats here 


* WordScan Plus allows up to 100 zones per page. WordScan allows one zone. 


| Quick Start: 
Pop-Up Verifier* 


Use the Pop-Up Verifier to verify your pages for correct recognition 


prior to saving your finished document: 


Process one or more pages (or 
open an existing WordScan Plus 
document). 


Choose Pop-Up Verifier from 
the Options menu. 


Press <Ctrl> + <L> to find the 
first uncertain character or word. 
An image of this character or 
word “pops up” and the cursor 
jumps to that position. 


Make corrections, if necessary. 


p 
oan 
B- 
a 
a 


Repeat steps 3 and 4 until you 
see the message: 


No more markers 


at the bottom of the Pop-Up 
Verifier window. 


Options 


Text Settings... 
Scanner Settings... 


Pop-Up Verifier... 
User Dictionary... 
Style Sheet... 

Preview Settings... 


Deter Processing 
Pracess Deferred Docs... 


File 


Edit Search Character Options 
Summation II combines 
worlds because it offer 


database capabilities 


Because of the acg 


> are ac 
computers 


au OCR produg 
1 software pach 
OQ ı 11, a 


* The Pop-Up Verifier is available only in WordScan Plus. To learn how to use the WordScan Verifier, see 


the WordScan User's Guide. 


Quick Start: 
/ Pop-Up Verifier 


Press <Ctrl> + <N> to save 
the page and display the next 
page for verification. 


Repeat steps 3 through 6 until 
you see the message: 


No more pages, 
exit Pop-Up Verifier? 


Click Yes to close the Pop-Up 
Verifier window. 


=a Save your finished document as 
normal. 


ER Select a directory 


= Pop-Up Verifier 


D No more pages, exit Pop-Up Verifier? 


Type a document Click Save to save your 
name here finished document 
ET N Sove As Document.. \ 7 


Document Name: 
Directory: c:\calera\bin 


PageMaker (TIFF) [®] 


RI Delete All Intermediate Files 


O Save Markeıs As: EJ] 


Directories: 


Select text and image 
here formats here 


CALERA 


RECOGNITION SYSTEMS 


Calera® is a registered trademark of Calera Recognition Systems, Inc. WordScan™ is a trademark of Calera Recognition Systems, Inc. 


© 1990 Calera Recognition Systems, Inc. 
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Important Notice to All Users of Calera Products 


rhe products manufactured and distributed Your license to use this product’s software 
„ Calera Recognition Systems permit read- also prohibits such activities. 
g into your computer and electronically i or 
oring almost any document or drawing This product is intended solely for copying 
vailable in printed form. Most printed and reproducing documents which you have 
aterials are copyrighted under the laws of the legal right to copy, either because you or 
€ United States and foreign countries. It is il- your company own the copyright, or because 
‚gal to copy or reproduce on disk or paper, you have the Persson of the copyright 
7 use of this machine or any other means, owner to do so. Many scientific and profes- 
aterials for which you or your company do sional journals give blanket permission to 
>t own the copyright, unless you have the copy and repr oduce portions of their con- 
armission of the copyright owner or unless tents by transmitting a fee per page copied, 
our activity is within some of the limited ex- usually specified in the journal, to the 
-ptions to the copyright laws. Copyright Copyright Clearance Center, 27 Congress 
‘fringement carries with it serious civil and Street, Salem, MA 01970, or to the publisher : 
minal penalties under the law. Do NOT Consult your attorney or the copyright owner 


se this device to copy materials illegally. with your specific questions. 
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Introduction 


About This 
Guide 


This guide gives you all the information you need to install and 
use WordScan™, With WordScan, you won't have to spend 
hours retyping pages just so you can use them in your word 
processing, spreadsheet, or database program. WordScan is an 
optical character recognition (OCR) system that works with 
your desktop scanner to quickly and accurately convert printed 
documents into text that you can edit with your computer. This 
guide will help you get WordScan up and running so you can 
start reading pages into your computer right away. 


How to Use This You'll find the information in this guide organized as follows: 
Guide 


Chapter 1, Introducing WordScan, gives you a brief overview 
of WordScan and lists the system requirements. 


Chapter 2, Installing WordScan, gives step-by-step 
instructions for installing WordScan on your computer. 


Chapter 3, Getting Started, shows you how to scan, process, 
proof, and save documents in no time at all. 


Chapter 4, The Setup Window, tells you how to change 
settings and define user dictionaries, describes basic scanning 
in Automatic mode, and explains the details of saving and 
retrieving your work. 


Chapter 5, The Preview Window, explains how to define a 
zone, change the scale of the displayed page image, and 
perform recognition on a single page or on a stack of pages. 


Chapter 6, The Verifier, shows you how to use WordScan’s 
editor to instantly find each uncertain character and word, so 
you can quickly make corrections. 


Chapter 7, Advanced Features, explains how to defer 
processing (scan now/recognize later), how to use templates, 


and how to suspend a document in process and return to it 
later. 


{bout This Guide @ ix 


Chapter 8, Getting the Most from WordScan, gives you 
suggestions, advice, and hints on getting the best recognition 
possible from your documents. 


Appendixes, a glossary, and an index provide other helpful 
information. 


Manual Here are a few notes about the typographic conventions used 
Conventions in this manual: 


Key Names The names of keys that you'll find on your computer keyboard 
are enclosed in angle brackets: 


<Esc>, <Enter>, <Ctrl>, <Alt> 


Key A plus sign (+) between two key names means that you must 
Combinations press those keys at the same time: 


and Sequences 
<Ctrl> + <Alt> + <Del> 


means press <Ctrl>, <Alt>, and <Del> simultaneously. 


Typographic 
Conventions 


PIOCOSSINE IYO EIS ern NEN ES 


Commands you type 
are shown in this bold 
typewriter style 


Launch WordScan 


To launch WordScan, type 
win wordscan <Enter> 


from the DOS prompt. You also can launch WordScan by 
using any of the normal Windows methods for launching 


Windows and menus are Programs: 

shown as actual pictures Setup Window - Untitled [0 pages) 

of the screen Reading for 
ae | al E Text Only 
Page Orientation =] 

Icon [is Porat 
Type Quality ~ 
B Normal 
Image Source — MM 

Button Scanner 

L 
| Brightness ———] 

N mer x: 50% 

Shortcuts are indicated [ Ca =< 

by this symbol [Start Processing 

Automatic 
Buttons, icons, and Look for Marker x <Ctrl>+<L> 


menu choices you click N 

are shown in bold Choose Look for Marker from the Search menu. When 
you have verified the last uncertain character or word, 
you'll see the following message: 


Messages you see on 


No more markers 
the screen are shown a 


in this typewriter style 


Markers 


Mark Suspicious Characters and Mark Suspi- 
cious Words must be enabled (X in the boxes) 
before you can use the Verifier to find those uncer- 


tain characters and words that were marked during 
, FN recognition. 
Key topics are 
separated from the 


main text by boxes 


Related Manuals Keep these manuals handy while you're installing and learning 
to use WordScan: 


I Installation Notes for Scanners and Fax Cards provides 
needed information about configuring other companies’ 
products for use with WordScan. 


J Microsoft Windows [3.0] User’s Guide explains all of the 
operations you can do with Windows version 3.0. You 
should know the basics of using Windows before you 
Start using WordScan. 


Chapter 1 


Introducing =_ 


WordScan 


2% Whails Wordscanz 


What Is 
WordScam? 


Why Is OCR 
Important? 


WordScan is an optical character recognition (OCR) program 
that can save you hours of retyping time on every printed 
document you need to process. It brings you the power and 


accuracy of Calera’s best recognition with unparalleled 
ease-of-use. 


With WordScan, you can do simple automatic recognition, or 
you can preview a document prior to recognition, drawing a 
box around an area you want to capture. After recognition, 
Calera’s Verifier can help you check and correct your document 
by instantly taking you to each uncertain character or word. 


You can use WordScan to automatically read a stack of printed 
documents and convert them into the format required by your 
word processing, spreadsheet, or database program. Or, you 
can deal with each page interactively, setting up a text, image, 
or numeric zone, to process only a specific area of the page. 


Or, you can repeatedly read a selected area of multiple pages 
by using templates. 


WordScan works in conjunction with your desktop scanner or 
fax card to recognize the letters and numbers on a page and 
convert them into information your computer can use. 


Simply put, a scanner takes an electronic picture of a page. 
That's all it does. You cannot edit or manipulate a picture of 


letters and numbers with your word processing, spreadsheet, or 
database program. 


To get meaningful data from a picture of a page, you need an 
OCR system like WordScan. An OCR system evaluates the 
picture of a page and finds shapes that might be letters or 
numbers. Different OCR systems use different means to 
determine which shapes constitute the various characters. 
WordScan is an omnifont system which means it can recognize 
virtually any typeface used in business and legal documents, 
newspapers and magazines, books and brochures. 


But just finding characters on the page is not enough. 
WordScan performs additional steps to provide complete 
document recognition. During recognition, WordScan 


WordScan 
Highlights 


WordScan Highliichts % 


compares the characters it finds to a built-in dictionary to see if 
they form actual words. This increases the accuracy of the 
resulting document. WordScan also evaluates the layout of the 
page, saving important formatting information. WordScan can 
handle documents with multiple columns, documents with a 
wide range of character sizes, and documents in various 
orientations (landscape, portrait, upside-down). You can also 
capture images (pictures, photos, line art) from your pages and 
save them in your choice of popular image formats. 


WordScan helps you complete your recognition jobs quickly 
and easily with the following features: 


/ Automatic mode quickly reads full pages into the 
application format you specify. 


/ A Preview window lets you define what area of the 
page you want to capture when you don’t want it all. 


/ Character and word marking means you can find 
uncertain characters and words quickly, using either the 
built-in Verifier or your own word processing program. 


/ The Verifier helps you check and quickly correct your 
document for recognition errors by showing you which 
words and characters were uncertain to WordScan. 


/ Double-sided recognition lets you scan a stack of 
double-sided pages. Scan one side, turn the stack over, 
then scan the other side. Your pages end up in the 
correct order automatically. 


/ User-defined recognition dictionaries mean you can 
add your own terms to a dictionary so they are not 
marked as uncertain. 


/ Templates give you the ability to save a page area 
(zone) that you want to capture repeatedly and apply it 
to future jobs. 
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/ Deferred processing means you can quickly scan 
documents, then perform the more time-consuming 
recognition task later, perhaps while you’re out to lunch 
or after you've gone home. 


Y/ Customized settings let you save all your settings, 
dictionaries, and templates for future use. 


/ Multiple column recognition helps you process 
columns of text and automatically put them in the right 
order in your finished document. 


Comprehensive format retention retains page 
formatting, centered, bold, italic, and underlined text, 
indented paragraphs, hanging indents, justification, and 
margins, so you don’t have spend a lot of time 
reformatting. 


System 
Requirements 


To use WordScan, you need: 


PC. AT or PS/2 An IBM PC/AT or true compatible or an IBM PS/2 MCA or true 


compatible. (An 80286 or higher processor is required. An 
80386 or higher is recommended.) 


2MB Memory Atleast 2 megabytes of total system memory. 


Hard Disk w6 MB A hard disk with at least 6 megabytes of unused space. 


Graphics monitor Microsoft Windows 3.0-compatible graphics display. 


Mouse Microsoft Windows 3.0-compatible mouse or other pointing 


device. 


DOS 3.1 PC-DOS or MS-DOS version 3.1 or higher. 
Windows 3.0 Microsoft Windows version 3.0. 
Scunner 


WordScan-compatible image scanner and interface card. 


The next chapter gives you step-by-step instructions on how to 
install WordScan. If someone else has already installed 
WordScan for you, skip Chapter 2 and proceed to Chapter 3. 


Chapter 2 


Installing N 


WordScan 
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Installing 
WordScan 


Installing Your 
Scanner 


Installing 
Windows 3.0 


Installing the 
WordScan 
Program 


This chapter explains how to install WordScan. You need to: 


/ Install your scanner. 
/ Install Microsoft Windows version 3.0. 


/ Install the WordScan program. 


Install your scanner and its interface card according to the 
instructions supplied with them. Also, check Installation Notes 
for Scanners and Fax Cards supplied with WordScan for 
additional important information. 


Before proceeding, make sure your scanner is working. Use the 
software supplied with the scanner to scan some images. 


Install Microsoft Windows 3.0 according to the instructions 
included with it. 


= Important: When you finish installing Windows 3.0, 
reboot your computer so that any changes to your 
CONFIG.SYS and AUTOEXEC.BAT files take effect. 


Make sure Windows is working properly before proceeding by 
running one or more of the programs included with it (such as 
Paintbrush, Notepad, Calculator, or Clock). Running these 
programs will ensure that Windows is installed properly and 
give you practice using Windows programs. 


1. Insert the WordScan #1 Installation disk into Drive A. 
2. From the DOS prompt, type: 


a:install <Enter> 


Troubleshooting @ z 


3. Follow the instructions on your screen to complete installa- 
tion. You'll be prompted to choose a program directory and 
a temporary file directory. Normally, you’ll want to choose 
the names suggested by the installation program. 


4. When installation is complete, put the WordScan disks 
away in a safe location. 


5. Reboot your computer. Press <Ctrl> + <Alt> + <Del> or turn 
the power off and on again to reboot. (Failure to reboot 
may result in an error condition when you launch Word- 
Scan.) 


Installation is done! You’re now ready to launch WordScan and 
get to work. Go on to Chapter 3 for a quick tutorial on getting 
started with WordScan. 


Tr oubleshooting If you encountered any problems during installation, consult 
Appendix C, “Troubleshooting,” for information on particular 
types of problems and their solutions, as well as information on 
using the Technical Assistant diagnostic program. 


Getting the There may have been minor changes in a feature or procedure 
Latest between the time this manual was printed and the time the 

A WordScan program was finished. If there are any such changes, 
Information they are explained in a file called README.TXT located in the 


C:\CALERA\ DIAG directory (or the \ DIAG subdirectory of the 
directory in which you installed WordScan). To view this file, 
type the following command at the DOS prompt: 


type c:\calera\diag\readme.txt | more 


If no changes took place, there will be no README.TXT file 
installed on your hard disk. 


oan 


Chapter 3 
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Getting Started To get you started with WordScan, this chapter leads you 
through two simple examples. In the first, you’ll scan a 
one-page document for text only. In the second, you'll use 
Preview mode to capture just a part of the page. 


= Important: If you are not the first person to use WordScan, 
the default settings may have been changed. Certain set- 
tings affect the outcome of this tutorial. If you suspect chan- 
ges have been made, you should check your settings 
against the factory default settings (see Appendix A). 


Before You Start Before you launch WordScan, and only if you chose not to 


have the installation program modify your computer's 
AUTOEXEC.BAT file, you must type the following command 
prior to starting Windows and launching WordScan: 


wordinit <Enter> 


This command sets the path for proper operation. When the 
DOS prompt (C:) returns, you can proceed to launch Windows 
and WordScan as described on page 12. 


mm Note: You need to repeat the above command each time 
you reboot or power on your computer. You do not need 
to enter this command if you quit WordScan and then 
launch it again without rebooting your computer. 


WordScan has many capabilities that are not fully explained in 
this tutorial section. If you want to learn every detail of how to 
use these capabilities, you should read the remainder of this 
manual. However, if you learn enough from this tutorial to 
satisfy your current recognition needs, you can continue using 
WordScan without spending more time reading. 


Setup Window 
Preview Window 
Menus 
How To 
Using Help 


When you come across a capability or feature you don’t know 
how to use, consult the on-line Help system. Choose a topic 
from the Help menu or choose Index to see a list of all topics. 
You'll find detailed information on using every WordScan 
feature, button, menu, and setting. 


Technical Assistant 


About WordScan... 


PrOGeSSiINe Youn Binst Document #11 


Pr ocessing For the first example, you'll process a one-page document for 
Your First text only. Use the Tutorial Sample Page #1 included with 


WordScan. It looks like this: 
Document 


Tutorial Sample Page #1 


Congratulations on buying a Calera recognition product! You 
have joined most of the Fortune 500 in purchasing the most 
accurate character recognition available today. 


This product will read typewritten, typeset, laser print, and 
even draft-quality dot-matrix documents. 


Our products are used in many different applications. Here 
are some examples: 


1. Word Processing: Scanning reports, memos, contracts and 
even correspondence. 


. Desktop Publishing: Generating newsletters by scanning 
articles from many different sources, revising presentations, 
updating technical manuals. 


. Full-Text Databases: Capturing competitive information, 
storing magazine and journal articles, scanning resumes. 


. Spreadsheets: Consolidating forecasts and budgets, 
analyzing competitive financial statements. 


You may have purchased this product with one application in 
mind, but we think you'll quickly find other uses for it! Calera 
Recognition Systems provides character recognition products for 
all your needs. 
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Launch WordScan 1. To launch WordScan, type: 
win /s wordscan <Enter> 


from the DOS prompt. (This starts Windows in “standard” 
mode which results in faster operation—sometimes up to 
twice as fast—by disabling virtual memory.) To launch 
WordScan while Windows is already running, find its icon 
in the WordScan group and double-click it. Ina moment, 
you'll see the WordScan Setup window, shown below. The 
icons in this window let you choose the most common set- 
tings for day-to-day recognition. When you click an icon, 


the descriptive text to its right changes to indicate the 
selected setting. 


Setup Window - Untitled (0 pages) 
Reading for 


Text Only 


Page Orientation 


Portrait 


Type Quality 


Image Source 


Scanner 
Bright 
308 


Start Processing 
| rs] 


Change Settings 2. Try it. Click the various icons in the Setup window and 
notice how the descriptions change. After you get a feel for 
how the icons work, make sure your settings match those 
shown in the Setup window above. 


Load the Scanner 


Start Processing 


Start Processing 


Progress Monitor 
Text Only 


page 1: 36 %done 


Cancel Processing 


3. 
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Put your document into the scanner in the proper orienta- 
tion. Consult your scanner’s documentation and look at the 
labels on the scanner itself to determine how to orient the 


page. 


Click Automatic to begin processing your document. Most 
scanners have a device 
that informs WordScan 
when one or more 
sheets of paper are 
loaded. If your scanner 
has this feature, scan- 
ning will now begin. If 
you are using a scanner 
without this feature, the 
following dialog box 
may appear: 


Automatic Processing 


You'll generally use Automatic 
mode unless you want to process 


only certain portions of the page 
by defining a zone. The next ex- 


ample explains how use the 
Preview window to define a zone. 


= Load Scanner 


O All pages in stack 
© Number of Pages! fi | 


O Feeder 


Starting Page: 1 


Paper Path: © Flatbed 


If you see this dialog box, just click Scan to begin scanning. 
After the scanner finishes scanning the page, you'll see the 
Progress Monitor. 


The Progress Monitor changes color from the top down as 
recognition proceeds, like someone is pulling down a win- 
dow shade. When the “window shade” reaches the bottom, 
recognition for that page is complete. 


+ $ Processing’? Yourmbhirpsa Document 


Stop Processing 


= Important: If the image of the page in the Progress 
Monitor appears upside-down or sideways, click Cancel 
Processing and repeat Step 4. This time, put the page into 
the scanner so that it will appear right side up in the 
Progress Monitor. If you continue to recognize a page that 
is incorrectly oriented, recognition will take a long, long 
time and will result in a page of jumbled characters that 
have nothing to do with the original document. 


When processing finishes, one of the following dialog 
boxes appears: 


/ Ifyou have a scanner with a sheet feeder: 


Q Scanner Empty 


Add Pages 


Turn Pages Over 


/ If you have a flatbed scanner: 


Starting Page: 1 Os pages in stack 


Paper Path; @Flatbed © Feeder 


5. Click Stop Scanning or Done depending on which of the 
above messages appears. Recognition is now done and 
WordScan has saved your document in its own internal for- 
mat for proofing. 
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Verify Your 6. Choose Verifier from the Options menu to look at your 
Document recognized document and make any needed changes and 


corrections. The Verifier window appears with your page 


Options 
Text Schinnz. loaded and ready to proof. 
Scanner Settings... 

mene... = om Verifier 
veriier... Elle Edit Search Character Qptions Help 


User Dictionary... 
Preview Settings... 


Tutorial Sanple Page #1 


Deter Processing 
Pracess Deferred Does... 


Congratulations on buying a Calera recognition product? You have joined 
Rost of the Fortune 500 in purchasing the rost accurate character 
recognition available today. 


This product will read typewritten, typeset, laser print, and even 
draft-quality dat-natrix dacunents. 


Our products are used in many different applications. Here are sone 
exanples: 


1. Word Processing: Scanning reports, memos, contracts and even 


correspondence. 
2. Desktop Publishing: Generating newsletters by scanning articles 
fron nany different sources, revising presentations, updating technical 


manuals. 


3. Full-Text Databases: Capturing competitive infornation, storing 
magazine and journal articles, scanning resunes. 


4. Spreadsheets: Consolidating forecasts and budgets, analyzing 
conpetitive Financial statements. 
You nay have purchased this product with one application in nind, but 


a. Check the page for misrecognized characters and 
words. If a correction is needed, use the arrow keys to 
position the cursor and then type your correction. Use 
<Backspace> and/or <Delete> to remove unwanted 


characters. 


b. Use the scroll bars to shift the page up and down as 
well as left and right so that you can see the entire 
page. Keep in mind that the document you see has not 
yet been formatted. The text appears in the same rela- 
tive position as it did in the original document. Word- 
Scan will correctly reformat the text when it saves the 
finished document in your choice of text formats. 
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Next Page Ctrl+N 
Go to Page... Ctrl+0 


Cut Page 
Paste Page 


Add ta Dictionary Culad 


Exit Verifier 


Save Your Finished 
Document 


File 


Save... 


Retrieve Settings... 
Save Settings... 
Retrieve Template... 
Save Template... 


Text: 


Manusciipt 2.1 
Microsoft RTF 
Microsoft Word 3.x 
Microsoft Word 4.0 
Microsoft Word 5.0 
MultiMate 3.1 
MultiMate 3.3 
MultiMate 3.6 


c. When you have checked the entire page, choose Exit 
from the File menu. You'll then see the following 
prompt: 


© Save current changes? 


d. Click Yes to save your changes and close the Verifier 
window. The Verifier saves the changes you made to 
your document in WordScan’s internal format. 


7. Now you are ready to save your finished document. 


Choose Save from the File menu. You'll see the following 
dialog box: 


= Save As Document... 


Directory: c:\calera\bin 


Directories: 


Text: |ASCII 
Image: |PageMaker (TIFF) 


& Delete All Intermediate Files 
o Save Markers As: EJ] 


a. Select the desired text format for your document from 
the drop-down list box next to Text. For this example. 
choose the format that will work with one of your ap- 
plication programs. (If you choose Microsoft Word 4.0, 
you can open the document using Microsoft Write, the 
word processing program included with Windows.) 


Document Name: 


c:\ 


Directory: 


=a 


Exit WordScan 


Eile 


New 

Open... 

Save... 

Save Unconverted... 

Retrieve Settings... 

Save Settings... 
Retrieve Template... 
Save Template... 


Brocéssine Yorn Kirst Documenti ON 


b. Type the name sample in the Document Name box, 
then click Save. (If you want to save the document in a 
different directory from the one displayed next to the 
Directory prompt, type the complete pathname in the 
Document Name box.) 


A progress monitor appears showing the progress of 
the conversion and save process. 


Progress Monitor 


Microsoft Word 4.0 


50 
10% done 


Cancel Processing 


8. When processing completes, choose Exit from the File 
menu to exit WordScan. 


Congratulations—you’re done! WordScan has scanned your 
page, recognized the text on it, and let you proof the page and 
save the results in a file on your disk, converting it to the word 
processing format of your choice. You can now bring up your 
word processing program and load the file sample. 


IS @ Processus Lour second Document 


Processing 
Your Second 
Document 


Now, let’s try an example that shows more of WordScan’s 
versatility. This time, use the Tutorial Sample Page #2. It 
looks like this: 


Enter Text Retrieval 
Databases 


Let’s go back a few years to 
the famous Scopes "Monkey 
Trial” of 1925 when dozens of 
law clerks must have waded 
through hundreds of dusty 
volumes so that chapter and 
verse, theory and counter 
theory, could be quoted 
accurately. "Labor intensive" 
hardly begins to describe 
these efforts. 

Now, imagine events a few 
short years from now. 
Scientists are designing a 
self-contained space habitat, 
and must enter thousands of 
research documents into a 
knowledge base of 
unprecedented sophistication. 
Or, researchers of the 21st 
Century need to sift through a 
vast library of medical 
journals for seemingly 
random connections to come 
up with a cure for the 
common cold. 


Tutorial Sample Page #2 


There is a common thread 
through these events. The 
historical example would 
have benefited from the 
ability to look through masses 
of data for common 
connections where no 
structured database existed. 
And, the achievements of the 
future may not be feasible 
without technologies now 
known as text retrieval 
databases using full-text 
indexing. 

What exactly does a text 
retrieval database do for your 
productivity? In a nutshell, it 
allows you to collect, locate 
and organize information 
with unprecedented speed, 
accuracy and flexibility. The 
applications are unlimited. 


Text Retrieval Databases 
vs. Relational Databases 
A relational database uses 
fields to organize data. Each 
field has one entry usually 


represented by one word. 
Text retrieval databases, on 
the other hand, are not 
restricted to information 
contained in a limited number 
of fields. Rather, they rely on 
key words which give the 
user far greater flexibility. 
Any word in the entire 
document can be a key word 
and you don’t have to decide 
ahead of time. 


Calera’s Day in Court 


Not so long ago, even the 
simplest litigation process 
required hours of research in 
a law library, days of 
laborious keyboard entry into 
a litigation support database, 
and sometimes court delays 
while evidence was 
cross-referenced. 

With OCR and a litigation 
support package, every 
document important toa case 
can be placed in a database 
with speed and accuracy. 

An example of its uses is 
illustrated by the Association 
of Trial Lawyers of America. 
This organization uses Calera 
products for educational 
materials, resource 
documents and research 
information-sharing cases. 
The organizations first project 
was to produce a 1200 page 
book for its annual 
convention. The association 
estimates that they saved 
$30,000 by eliminating 
contractor costs for typing, 
typesetting and layout. 


TP-00139-0-A 
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Notice that the page has an illustration and an article divided 
into three sections. In this example, you'll use Preview mode to 
capture one of the sections. 


Launch WordScan 


Prepare to Scan 


Start Processing 


BrocessingaYounSsecond Document OI 


1. Launch WordScan as you did in the previous example. 


2. Do these steps to prepare to scan the second sample page: 


a. Choose Text Settings from the Options menu. Click 
the two check boxes Mark Suspicious Characters 
and Mark Suspicious Words. The dialog box should 
look like this: 


DX] Mark Suspicious Characters! 
[X Mark Suspicious Words 


Format: [ ASCII E 


b. Click OK to confirm your settings. 


c. PutSample #2 into your scanner. 


Click Preview to start processing. If your scanner doesn’t 
have a sheet feeder, or if its sheet feeder doesn’t indicate 
when pages have been loaded, you'll see the following 
dialog box: 


Starling Page: 1 © All pages in stack 
@ Number of Pagesi E] 
Paper Path: @ Flatbed © Feeder 


Click Scan to proceed. 


In a moment, the Preview window appears with part of 
your document visible, as shown on the following page. 
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Maximize the 4. 
Preview Window 


«— Setup window 
Maximize icon 


Preview window 
Maximize icon 


Fil To Window 
Quarter Scale 
Half Scale 

v Full Scale 


Define Text Zones 5. 


Calera WordScan 
File Edit Options Zoom Template Help 


Preview 


a for 
6 fa E 


Be Sat 


Type „u | 
er Normal 
Image Source 
BA Scanner 

Brightness | 

ae * CHE aT) 50% |] 

darken | 
tart Processing 
[Lives ft [ Prev view 
Make sure Bea page appears danh side up. If it doesn't, 


orient the page correctly, and then scan it again by clicking 


the ReScan button in the upper right corner of the Preview 
window. 


It’s easier to define a zone when you can see more of the 
page at one time. To see the biggest portion of the page, 
maximize the Preview window. Click the Maximize icon in 
the upper right corner of the Setup window. Then click the 


Maximize icon in the upper right corner of the Preview win- 
dow. 


Use the various scaling commands from the Zoom menu to 
get a good view of the page image. 


Using the Text tool, define a text zone to capture the text of 
the third section on this page, as follows: 


a. The mouse pointer is now a crosshair. Move the pointer 
to the upper left corner of the heading in column 3. 


b. Press and hold down the mouse button, then drag the 
mouse down and to the right to draw a rectangle 
around the text you want to include in the zone. 


Text Zone 


Process the Page 


File Edit Options Zoo 


[>| 


Youmsecond Document & 


When you finish, your screen should look similar to this: 


Calera WordScan 
File Edit Options Zoom Template Help 
Preview 


Note that the zone is marked TXT indicating it is a text 
zone. 


7. Click Process One to process the page. The Progress 
Monitor appears, but this time, only the zone you defined 
will show progress as recognition occurs. 


Progress Monitor 
Template 


The text zone is 
being processed. 


page 1: 74 done 


Cancel Processing 


1 


O Processing YounsSecondiDocument 


When processing is complete, you’ll see one of the following 
alert boxes, depending on your scanner: 


Q Scanner Empty 


Add Pages 
Turn Pages Over 


Starting Page: © All pages in stack 


OJ 


Paper Path: @ Flatbed 


Click Stop Scanning or Done and the Preview window 
will close. WordScan has saved the contents of your text 
zone in its internal format for proofing. 


Verify Your 
Document 


Options 
Text Settings... 
Scanner Settings... 


Verifier... 
User Dictionary... 
Preview Settings... 


Deter Processing 
Process Deferred Docs... 


=z Dictionary 


PVOGeSSine YOu Second Document % 


You may now proof and save your document. However, 
first open a user dictionary so that you can add words that 
are correctly spelled but have been marked as uncertain. 


a. Choose User Dictionary from the Options menu. 
You'll see this dialog box: 


Dictionaries Available: 


| Close 


Words in Dictionary: 


= 


Word Defined: 


| 


b. Click New and type a name for your dictionary. For this 
example, type the name tutor, then press <Enter>. 


New Dictionary Name: 


23% Processing Your Second Document 


c. The name of your new dictionary now appears in the 
Dictionaries Available list. Click the name so that it is 
highlighted, then click Open. 


= User Dictionary 


Dictionaries Available: 


u 


Words in Dictionary: TUTOR 


Delete 


Word Defined: 


While the User Dictionary dialog box is on your screen, 
you can add words to the dictionary by typing them 
into the Word Defined box. For complete information 
on user dictionaries, see page 38. 


A] d. Click OK to close the User Dictionary dialog box. 
9. Now, proof your document just like you did for Sample 

Document #1, but this time, add words that are marked as 
uncertain but spelled correctly. Here is a brief summary of 
the steps: 

Search a. Choose Verifier from the Options menu. 

Look for Marker Ctrl#L 

Find... b. Choose Look for Marker from the Search menu, or 

Find Next F3 


use the shortcut <Ctrl> + < L >. The cursor moves to 
the first marked character or word. 
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aif Edit Search Character Options 


c. Make corrections if necessary, oradd correctly spelled 
words to the dictionary by choosing Add To Diction- 
ary from the File menu (or by pressing <Ctrl> + <A>). 

CutPage After the word is added, the Verifier removes the 

Paste Page marker. Press <Ctrl> + < L > to find any subsequent un- 


Add to Dictionary Ctrira MM certain characters and words. 
rn 


Exit Pop-Up Verifier 


uhile evidence was 
crossreferenced. 


Next Page CtrltN 
Go to Page... Ctrl+O 


d. Repeat Step c until you see the message: 


No more markers 
at the bottom of the Verifier window. 


e. Press <Ctrl> + <N> to instruct the Verifier to save the 
changes you made to this page and load the next page 
of your document. Since your document has only one 
page, you'll see the following alert box: 


Verifier 


D No more pages, exit Verifier? 


f. Click <Yes> to return to the Setup window. 


Save Your 
Document 


File 


New 
Open... 


Save... 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 


Exit WordScan 


26 @ Processin2 YOLUN Second DOGuNIeNL 


10. Now you are ready to save your finished document. 
Choose Save from the File menu. You'll see the following 
dialog box: 


= Save As Document... 
Document Name: | ] Save 


Directory: c:\calera\bin 


Directories: 
Text: |ASCII 


Image: |PageMaker (TIFF) [>] 


I Delete All Intermediate Files 
O Save Markers As: E] 


a. Select the desired text format for your document from 


the drop-down list box next to Text. Choose the format 
that will work with one of your application programs. 


b. Type the name preview in the Document Name box, 


then click Save. (If you want to save the document in a 
different directory from the one displayed next to the 
Directory prompt, type the complete pathname in the 
Document Name box.) 


11. When processing completes, choose Exit from the File 
menu to exit WordScan. 
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Window 
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A Tour of the 
Setup Window 


This chapter tells you how to use many of the Setup window 


settings and menu choices to perform recognition in Automatic 
mode including: 


Jf 
/ 
/ 
J 
J 


Making basic settings via the Setup window. 
Using Automatic mode. 

Saving and retrieving your work and your settings. 
Changing text and scanner settings. 


Defining and using user dictionaries. 


By learning some of these basic operations, you can improve 
the quality of recognition by adjusting certain settings and by 
supplying your own terms for WordScan to use during 
recognition. 


The Setup window lets you change common WordScan 
settings. In Chapter 3, you checked your settings against a 
picture of the Setup window to make sure that everything was 


set to “normal.” Now you'll learn how each setting affects 
recognition. 


Setup Window - Untitled (0 pages] 
Reading for 


al al Ba Ei Text Only 
BER E B] Portrait 

Type Quality 

| re Normal | 


Image Source 
a) reo 
| Scanner | 
Brightness 
x BB | 50% 
arken lighten 
Start Processing 


The Setup window is divided into six groups. Only one icon 
from each group can be selected at a time. You make a setting 
by pointing to the icon that depicts what you want and clicking 
the mouse button. WordScan highlights that icon and displays a 
text description of the setting. 


Reading for These icons let you specify what you want WordScan to 
process: 
À C Text The text (only) of each page is recognized. Areas 
B of the page not recognized as text (such as 


illustrations and photos) are ignored. Unless 
otherwise specified, the text of the entire 
document is saved as one file when you choose 
the Save command. 


Image A full page image is captured for each page 
scanned. No recognition is performed. Each entire 
page (both text and illustrations) is saved as a 
separate image file and cannot be edited in 
text-based applications. 


=i? 


Text & The text of each page is recognized andan 

Image image of the entire page is captured. You'll end 
up with one file containing all of the text, plus a 
separate file for each page's image. 


Template Pages are processed using a template you 
previously defined. Templates specify areas of the 
page (zones) to capture and process as either text, 
image, text and image, or numbers. For more 
information on using templates, see page 88. 


EH E 
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Page Orientation 


Type Quality 


These icons let you select the orientation of your pages; that is, 


the direction in which the text appears relative to the direction 
of the paper. 


= Important: Choosing the correct orientation is critical to 
successful recognition. If you choose the wrong orientation, 
you'll end up with a document full of meaningless text and 
symbols after a very long recognition time. Always choose 
the orientation that results in text appearing right side up in 
the Progress Monitor or Preview window. 


Portrait Text reads from top to bottom when you hold the 


page with its narrowest dimension at the top and 
bottom, like a portrait picture. 


Landscape Text reads from top to bottom when you hold the 
page with its widest dimension at the top and 
bottom, like a landscape picture. 


Flipped _Upside-down portrait orientation. Generally, you'll 
use this setting when you have no control over how a 
document was placed in the scanner, such as when 
reading image files from disk and files from a fax card. 


FlipScape Upside-down landscape orientation. Again, this 
setting is generally used when you have no control 
over the orientation of the original document, such 
as when reading image files and fax files from disk. 


These two icons let you select the type quality of characters 
used in your original document. 


Normal Normal printed text, including typeset and 
laser-printed documents. Always use this setting 
for any document that contains 
proportionally-spaced text (like Times Roman or 
Helvetica). 


Dot Matrix Draft-quality dot-matrix documents and 
documents that are entirely in a monospaced font 
(like Courier). It is especially important that you 
use this setting when your document is 
draft-quality dot-matrix. 


Image Source These three icons select where the page images will come from. 


Scanner Use your scanner as the image source. 
Disk Obtain images from disk files. 


be 
Mi) 
= 
Fey Fax Obtain images from files saved by your fax card. 
ity 


= Note: WordScan only uses files saved by the fax card. It 
does not let you interact directly with the fax card to send 
and receive faxes. 


Brightness This slide bar lets you lighten and darken pages to compensate 
for less-than-perfect originals. You must adjust the brightness 
wie setting before scanning. Changing this setting does not affect an 
F 3 image that has already been scanned or a page that is stored on 
I disk. 


Brightness 


darken 


To set the brightness, you can: 


/ Click the left or right arrows to change the setting in 
small steps. 


/ Click to the left or right of the white square to change the 
setting in large steps. If your scanner has preset “darker” 
and “lighter” settings, the displayed percentage will 
change to Darker or Lighter, accordingly. 


/ Click and drag the white square to the desired setting. 


/ If your scanner has an “automatic” brightness setting, 
clicking the “sun” icon will enable automatic brightness 
and change the displayed percentage to Auto. 
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Changing the brightness has the same effect as on a 
photocopier. Choose darken when scanning light pages, and 
choose lighten when scanning dark pages. 


Important: The range of the Brightness setting depends on 
the capabilities of your scanner. Some scanners have very 
good brightness control, while others offer very little im- 
provement. For more information on your particular scan- 
ner, consult the Installation Notes for Scanners and Fax 
Cards included with WordScan. 


Start Processing The two buttons under Start Processing select either Automatic 


mode or Preview mode. Automatic mode is explained in the 


following section. Preview mode operations are explained in 
Chapter 5. 


Automatic Mode 


Use Automatic mode when you want to recognize complete 
pages of text, image, or both and you don’t want to preview the 
page image first. This is the fastest and most straightforward 
way to process full pages of text and/or image. 


Click Automatic to start processing in Automatic mode. 


Start Processing 


Normally, WordScan will scan the first page of your document, 


then display the Progress Monitor: 


Progress Monitor 
Text Only 


page 1: 36 %done 


Cancel Processing 


AS recognition progresses, the image in the Progress Monitor is 
shaded to indicate recognition progress. When the Progress 
Monitor reaches 100%, recognition is complete. 


If there are more pages in the scanner, WordScan will scan and 
then recognize each additional page according to the Reading 
for setting. If you are using a flatbed scanner or a scanner that 
can't sense when pages have been inserted, you'll see the 
following dialog box before each page is scanned: 


Starting Page: © All pages in stack 


© Number of Pages: EC 
Paper Path: © Flatbed © Feeder 


If you are using a scanner with a document feeder and the 
document feeder becomes empty, you'll see the following: 


Q Scanner Empty 


‘Stop Scanning 


Turn Pages Over 


Add Pages 


At this point, you can either click Stop Scanning if your job is 
complete, Add Pages if you need to process additional pages, 
or Turn Pages Over to scan the other side of a double-sided 


document. 


FE Double Sided iRecoenition 


Double-Sided If you have a scanner with an automatic document feeder, you 
Recognition can take advantage of WordScan’s ability to automatically 


process double-sided documents. It’s easy, and it works in both 
Automatic and Preview modes. 


1. Load your scanner with as much of the double-sided docu- 
ment as will fit into the document feeder, so that the fronts 
of each page will be scanned. 


2. Begin processing as usual. When the following alert box ap- 
pears: 


® Scanner Empty 


Stop Scanning 


Add Pages 
Turn Pages Over 


You can either: 


a. Add more pages (front sides) to the document. 


b. Turn the document over and process the back sides of 


the pages, so that the last page of the document is 
scanned first. 


Important: You must scan all of the front sides before you 
scan the backs. If you have multiple stacks, when you tum 
the document over you must scan the backs of the last 


stack of pages first, and work back toward the beginning of 
your document. 


When you finish your document, the pages will be in the 
correct order. 


Text Settings 


Text settings affect how text is processed. To see or change text 
settings, choose Text Settings from the Options menu. The 
Text Settings dialog box appears. 


Options 
Text Settings... 


Scanner Settings... 


Verifier... 
User Dictionary... 
Preview Settings... 


Defer Processing 


C Maik Suspicious Characters 
L Mark Suspicious Words 


Process Deferred Does... 


Format: | ASCII E 


When enabled (X in the box), uncertain characters are marked 
so that they are highlighted in the Verifier window or marked 
for proofing in the word processing format of your choice 
when you save your document. 


Mark Suspicious 
Characters 


Mark Suspicious When enabled (X in the box), uncertain words are marked so 

Words that they are highlighted in the Verifier window or marked for 
proofing in the word processing format of your choice when 
you save your document. 


If your WordScan came with language options, choose the 
language you want to use from this drop-down list box before 


processing your pages. 


Language 


Choose the desired text format from this drop-down list box, 
preferably one that matches your application program, if 
possible. Most word processing programs will import ASCH, 
Decolumnized ASCII, Rich Text Format (RTF), or Document 
Content Architecture/Revisable Form Text (DCA/RFT). 
Microsoft Write (the word processor included with Windows 
3.0) will import Microsoft Word 3.x or 4.0 files. 


Text Format 


Click OK after you have made the desired settings. Click 
Cancel if you want to revert to the previous settings. 


Scanner Settings 


Options 


Scanner Settings... 


Verifier... 
User Dictionary... 
Preview Settings... 


Defer Processing 
Process Deferred Dots... 
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== Note: The changes you make in Text Settings remain in ef- 
fect only until you quit WordScan (or until you retrieve a 
Settings file). You should save your desired Text Settings 
(especially Text Format) so that they will be used auto- 
matically, every time you launch WordScan. To do this, 
choose Save Settings from the File Menu, as described on 
page 46, and save your settings under the name DEFAULT. 


Scanner settings affect how images are scanned. (These settings 
can affect text recognition.) To change scanner settings, choose 
Scanner Settings from the Options menu. Scanner Settings 
is dimmed when Image Source is set to Disk, because these 
settings affect only images that haven't yet been scanned. 


Image Type 


O dither: [m8 Faning [E] 


Contrast: 


© “| 


50% 


Ls) 


Page Size: 
| ® Letter OA4 O Legal | 
Scan Resolution: |300 d.p.i. [=| 
Image Format: |PageMaker (TIFF) B 


Image Type 


Contrast 


Choose Binary for scanning pages with text, line art. or 
photographs that have already been halftoned (converted to 
tiny dots of black and white to represent gray tones). Choose 
Dither for scanning pages for continuous-tone images only. 
(Continuous-tone images contain grays or colors other than 
black and white). Because recognition requires a high-contrast. 
sharply-defined black and white image, WordScan won't let 
you process text in either Automatic or Preview mode when 
Dither is selected. 


Dithering 


Dithering is a process of converting gray tones 
to a pattern of black and white dots. Dithering is 
roughly equivalent to halftoning, but generally 


is much coarser due to the limitations of scan- 
ning and printing at 300 dots-per-inch. When an 
image is dithered, clusters of dots act as a unit to 
represent a particular shade of gray. 


If you choose dither, then select the desired dithering option 
from the drop-down list box. The choices depend on your 
scanner. Consult the documentation that came with your 
scanner to determine the best choice for your images. 


Use this slide bar to change your scanner’s contrast setting. This 
is different from the Brightness setting in the Setup window. 


Contrast: 


Contrast does not affect the quality of recognition; brightness 
does. You change the Contrast setting to improve the quality 
of the images you capture. If your scanner does not provide 
both a contrast and brightness setting, the Contrast setting is 
disabled. Consult your scanner's documentation for more 
information about how to use this setting. 


/ Click the left or right arrows to change the setting in 
small steps. 


/ Click to the left or right of the white square to change the 
setting in large steps. If your scanner has preset “darker” 
and “lighter” settings, the displayed percentage will 
change to Darker or Lighter, accordingly. 


/ Click and drag the white square to the desired setting. 


/ If your scanner has an “automatic” contrast setting, 
clicking the “moon” icon will enable automatic contrast 
and change the displayed percentage to Auto. 


Scan Resolution Choose the desired scan resolution from this drop-down list 
box. Choices depend on the scanner you are using. Generally, 
you want to use 300 dots-per-inch (dpi) when scanning for 
text. You might want to use another setting for special 
recognition situations or for capturing images. 


Image Format Choose the image format that matches or is supported by your 


application program. 


Click OK after you have made the desired settings. Click 
Cancel if you want to revert to the previous settings. 


m= Note: The changes you make to the Scanner Settings 
remain in effect until you quit WordScan, or until you 
retrieve a Settings file. If you want to save your new Scan- 
ner Settings, choose Save Settings from the File Menu, as 


described on page 46. 
Page Size Choose the page size you are scanning. Select Letter to scan 
84" x 11" pages, Legal to scan 82" x 14" pages, or A4 to scan 
21cm x 29.7cm pages. 
User WordScan has a large built-in dictionary that it uses during 
Dictionaries recognition as one of many steps in determining the certainty 


of the words it recognizes. In addition, WordScan lets you 
define your own user dictionaries to supplement the built-in 


dictionary. If, when proofing your document, you consistently 
encounter marked words that are spelled correctly, you can 


Options 


Text Settings... 
Scanner Settings... 


User Dictionary... 


Deter Processing 
Pracess Deferred Does... 


New 


Open 


prevent future marking by adding these words to a user 
dictionary. 


To define a user dictionary, choose User Dictionary from the 
Options menu. 


Dictionaries Available: 


t= 


Words in Dictionary: 


ve 


Word Defined: 


Click New to create a new dictionary. The following dialog box 
appears: 


New Dictionary Name: 


Type a name for your dictionary using up to eight characters. 
You should name your dictionaries to indicate what kinds of 
words they contain. For example, you might name a dictionary 
of legal terms LEGAL. After you enter a name, the new 
dictionary appears in the list of Dictionaries Available. 


Before you can add words to a dictionary or use it for 
recognition, it must be opened. To open a dictionary, first click 
its name to select it, then click Open. The words in the 
dictionary then appear in the Words in Dictionary list. 


Delete (dictionary) 


Import 


Add 


Open Dictionary 


The recognition software has a built-in dic- 


tionary of approximately 75,000 words that is 
used at all times. Each user dictionary that you 


define can contain up to 1,000 additional words. 
You can define as many user dictionaries as you 


want; however, only one user dictionary can be 
in use at a time. 


The upper Delete button erases all the words in the selected 
dictionary from your disk. To delete a dictionary, first click its 
name to select it, then click Delete. 


WordScan lets you import a list of words into a dictionary. This 
lets you use, for example, a existing dictionary from your word 
processor. The word list you import must be a plain text 
(ASCID file with each word separated from other words bya 
space, a return, or a tab. To import the word list, first open an 


existing dictionary (or create a new dictionary and open it, if 
desired), then click Import. 


= Import Dictionary... 


A 
F Cancel 
Eiles Directories 


autoexec.bat 


command.com 
config.org 
config.sys 
frecover.bak 
frecover.dat 
osheet.dat 
samp 


Use the list box to select the word list you want to import, then 
click OK. WordScan imports your words, then sorts and adds 
them to the open dictionary. 


Once you have opened a dictionary, you can add words to it. 
Type the word in the Word Defined box at the bottom of the 


dialog box, then click Add. WordScan then adds the word if it 
isn’t already in the dictionary. 
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< Press <Enter> after typing each word you want to add, 
rather than clicking Add. 


m= Note: You can also add words to the open user dictionary 
while you are using the Verifier. For more information, see 


page 71. 


Replace You can replace a word in the dictionary with a new word. To 
do this, click on the word you want to replace in the Words in 
Dictionary list. Then type a new word in the Word Defined 
box and click Replace. 


== Important: When you add a word to a user dictionary, the 
word is not used until the next time you recognize a page 
with the same user dictionary open. To make sure that a 
certain user dictionary is always used, choose Save Set- 
tings from WordScan’s File menu and save your current set- 
tings (including the open dictionary) as default. 


Delete (word) The lower Delete button deletes individual words from the 
open dictionary. To delete a word, first select it by clicking in 
the Words in Dictionary list. Then click Delete. You also can 
delete multiple words at once. Just click on each word you 
want to delete to highlight it, then click Delete. If you 
accidentally select a word that you won't want to delete, click 
on it again to unselect it. 


Click OK to save any changes you made to your dictionary. 
WordScan clears the existing user dictionary from memory and 
then loads the words in the currently-opened user dictionary 
for use the next time you process pages. 


== Note: If you want to use a particular user dictionary auto- 
matically each time you use WordScan, save your settings 
now. Then each time you retrieve the Settings file (or each 
time you launch WordScan if you saved your settings as 
DEFAULT), you'll see the message: 


Loading Dictionary 


DOSOMO REMCO YOT WONK 


for a few seconds while the dictionary is transferred into 


memory. 

Saving and The File menu has commands for saving a document, opening 

Retrieving Your  arew document, saving and retrieving settings, and quitting 
WordScan. 


Work 


Save... 
Save Uacanverted... 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 


Saving Your When WordScan finishes processing your document, you must 
Finished save it. This creates one or more files of the text and/or images 
Document in the format(s) you specify. If you want to save the same 


document in more than one format, you can save it multiple 


times, changing the text and image format and the document 
name each time. 


File 


Save... 


Retrieve Settings... 
Saye Settings... 


Retrieve Template... 
Save Template... 


To save your document, choose Save from the File menu. 
Everything processed since you launched WordScan (or last 
selected New or Save) will be saved. The following dialog box 
appears: 


“Save As Document... 


Document Name: [id 
Directory: c:\calera\bin 


Image: |PageMaker (TIFF) 5 


RI Delete All Intermediate Files 
Save Markers As: 
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This shows the directory where the file will be saved. You 


cannot type in this area—it is for display purposes only. See 
“Directories” on page 44 to learn how to change the pathname. 


Directory 


Document Name Type a name for your document using standard DOS file 
naming conventions. The name generally can be up to eight 
characters long. WordScan supplies a three-character filename 
extension appropriate for the text and image formats you have 
selected. See “Document Naming’ on page 44 for details of 
how WordScan names the various parts of your document. 
(Note that you can also type in a complete path name in this 
box as an alternative to selecting the drive and directory from 
the Directories list box.) 


Text Format Choose the desired text format from this drop-down list box. 


Text: |Microsoft Word 5.0 B 


Manuscript 2.1 
Microsoft RTF 
Microsoft Word 3.x u 


Microsoft Word 4.0 


MultiMate 3.6  |+ 
Image Format Choose the desired image format from this drop-down list box. 


Image: |PageMaker (TIFF) BE 


PageMaker (TIFF 


PC Paintbrush 
CX 


PDA g 


Delete All During recognition, WordScan saves the text and images from 

Intermediate Files each page in its own internal format. Normally Delete All 
Intermediate Files is enabled (X in the box), so WordScan 
will delete these files after saving your finished document and 
automatically begin a new document. 


If you want to save the intermediate files (so that you can add 
more pages to the document or so that you can save the 
document multiple times using different text and image 
formats), disable Delete All Intermediate Files (no X in the 
box). Then WordScan will retain the intermediate files and will 
not begin a new document after completing the Save operation. 
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Save Markers As 


Directories 


During recognition, WordScan keeps track of all characters and 
words for which recognition was uncertain. You have the 
option of marking these uncertain characters and words in your 
finished document and specifying the character to use as a 
marker. If you want your finished document to contain 
markers, enable Save Markers As (X in the box) and type the 
desired marker character in the box. You should specify a 
character that is used infrequently so that you can use your 
program’s search function to quickly find all markers. 


= Note: Only those uncertain characters and words that have 
not been processed by the Verifier will appear in your final 
document. Any marked characters or words that you either 
corrected or confirmed by using Verifier commands will not 
be marked in the final document. 


Specify where to save your finished document in this box. All 
directories accessible from the current directory are shown in 
the list box, followed by all disk drives. Double-click on the 
desired drive and directory names until you have specified the 
desired pathname. Double-click on the [..] entry to move up to 
the parent directory. (Or, type a complete pathname in the 
Document Name box.) 


Once you have specified a document name and selected the 
desired options, click Save to save your document. 


WordScan names documents according to the name you 
specify in the Save command. But it doesn’t always use the 
entire eight-character document name. 


When your document includes images, the text filename is not 
affected. Image files, however, are named according to the 
following rules: 


abcde-nn.ext 
where: 


abcde is the first five characters of the filename you 
specified. 


Example 


Starting a New 
Document 


Save... 
Save Uncanyerted... 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 
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nn is a number indicating the page on which the 
image originally appeared. 


ext is the three-character DOS filename extension for 
the selected file format (for example, .PCX, .TIF, or 
.PDA). 


If your original document contains more than 99 pages, the 
filename is shortened even more to allow room for the 


additional digit(s). 


If a document you named article had images on the fifth and 
eights pages, WordScan would create the following files: 


article.doc 


This file contains the text of document. (The extension may be 
different depending on which text format you chose.) 


artic-05.tif 
artic-08.tif 


These contain the images on pages 5 and 8. (The extension 
may be different depending on which image format you chose.) 


In most cases, WordScan begins a new document each time 
you choose Save from the File menu. However, if you have 
disabled Delete All Intermediate Files (no X in the box), 
WordScan retains the pages for further processing after the 


document is saved. 


If you want to begin a new document, choose New from the 
File menu. Generally, you'll want to do this whenever you want 
to discard your work-in-progress and start over again. 


You can tell if you are working on a new document by looking 
at the top of the Setup window. Here you'll see the number of 
pages processed so far in the current document. If the message 
reads 0 Pages, you are working on a new document. 


Setup Window - Untitled (0 pages) 
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ave Unconverted... 


Retrieve Template... 
Save Template... 


If you haven't yet done a Save or a Save Unconverted 
command on your existing work, you'll see the following 
message: 


Q Save Current Document? 


Save Unconverted 


Cancel 


You now have four choices: 


1. Save the existing document. Click Save and you'll be taken 
to the Save dialog box. 


2. Save your work-in-progress. Click Save Unconverted and 
you'll be taken to the Save Unconverted dialog box from 
which you can specify a filename to store the document 
until you are ready to come back and finish processing it 
later. (For complete information on saving and restoring 
your work-in-progress, see page 87.) 


3. Proceed without saving the pages. Click Discard. 


4. Return to your work without saving. Click Cancel. 


When you make changes to WordScan’s settings, you 
temporarily change the way the program works. Next time you 
launch WordScan, however, the settings revert to the way they 
were before you made your changes unless you save them. 


To save your settings, choose Save Settings from the File 
menu. You now have two choices for saving settings: 


1. Save your current settings as the default. Click DEFAULT in 
the Settings list box, then click Save. From now on, every 
time you launch WordScan, these settings will be used un- 
less you change the settings manually or retrieve a different 
Settings file. 
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Save Settings As... 


Save Settings Name: 


DEFAULT 


Settings: 


DEFAULT 


Save your current settings as a specific Settings file. Type a 
name of up to eight characters that relates to the use of the 
settings, then click Save. For example, LEGAL would be a 
good name for a Settings file tailored for reading legal docu- 
ments. 


= Save Settings As... 


Save Settings Name: 


Settings: 
DEFAULT 


Important: You should save your settings as DEFAULT 
after you have selected the desired text and image formats 
under Text Settings and Scanner Settings in the Options 
menu. Thereafter you can save your documents without the 
extra step of selecting these items each time you save. 
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Retrieving 
Settings 


Save... 
Save Uncanverted.. 


Retrieve Settings... 


Retrieve Template... 
Save Template... 


9 


Ye 


PAC 


When you save your settings, you are taking a “snapshot” of all 
the settings you can make in WordScan, including: 


All Setup Window Settings: Open User-Defined Dictionary 


e Reading for Template 

e Page Orientation Preview Settings: 

e Type Quality e Preserve Template 

e Image Source e Grid Marks 

e _ Brightness e Automatic Scan 

All Scanner Settings: Defer Processing 

e Image Type All Text Settings: 

e Contrast e Mark Suspicious Characters 
e = Scan Resolution e Mark Suspicious Words 
e Image Format e Text Format 

e © Page Size 


You'll want to use Retrieve Settings for three different reasons: 


1. To easily change all your settings to those previously saved. 


2. To restore the default settings when the changed settings 
are undesirable. 


3. To delete unused Settings files. 


Choose Retrieve Settings from the File Menu. The following 
dialog box appears: 


= Retrieve Settings... 


Retrieve Settings Name: 


NEWSLETT 


Settings: 
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Click the name of the Settings file you want, then click Open to 
retrieve it or Delete to delete it. Delete is dimmed when you 
select DEFAULT—you cannot delete the default Settings file. 


Once you have retrieved the desired settings file, you can 
proceed with your work as usual. The new settings take effect 
instantly. 


= Important: Retrieving a Settings file that opens a user- 
defined dictionary has no effect on pages that have already 
been processed. The dictionary is used during recognition 
to increase accuracy and reduce the number of marked but 
correctly-spelled words. 


Chapter 5 


The Preview =_ 
Window 


| 


? 


( 
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The Preview The Preview window lets you look at each page prior to 
y page p 
Window performing recognition, so you can easily decide whether to 


include or exclude portions of pages. This shortens the time it 
takes to process the page and the time you'll have to spend 
getting rid of extraneous data later. 


You can use the Preview window to: 
Y Select only a specific area of a page to read. 
Y Scan a page to check image quality. 


/ Capture an area of text. image, or numbers only without 
processing and editing the entire page. 


Y Create templates to process stacks of pages. 


Using the To use the Preview window, click Preview at the bottom of the 
Preview Setup window. WordScan then scans the first page, opens the 
Window Preview window, and displays the page: 


Reading for 
Sa RA Text Only 


Fi Orientation 


ka Portrait 


Type Quality 
t EB Normal 
unage Source 
i Lo 
Tool palette B Scanner || 
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ae 50% 
daken bghen 


Start Processing 


The Preview window’s tool palette—the column of icons down 
the left side of the Preview window—is used for defining and 
working with zones; vertical and horizontal scroll bars help you 
adjust the position of the page in the window. 


Defining a Zone 


A zone is an area of the page that you want WordScan to 
process. You can define a zone to capture text, image, text and 
image, or numbers only. 


To define a zone, first select the appropriate tool from the Tool 
palette. 


Text tool Use to define a zone around text you want 
recognized. 


Image tool Use to a define zone around an image you want 
captured. 


Text&Img Use to define a zone around an area you want 
tool recognized as text and captured as an image. 


Numeric Use to define a zone around an area you want to 
tool recognize for numbers only. 


Numbers 


Numbers are a special kind of text zone that 
recognizes only the digits 0 through 9 and the 
symbols comma, period, +, -, (, ), $, #, and /. 


By defining numeric zones around areas that 
contain only these characters, numbers will be 
recognized more accurately. For example, a 1 
will never by mistaken for the letters I or I, and 
a 0 will never be mistaken for an O. 


After you select a tool, point to where you want the zone to 
start, then click and drag the mouse to draw a rectangle around 
the desired text or image. Release the mouse button to 
complete the operation. (If you make a mistake and want to 
start over, press the <Delete> key to delete the zone you just 
defined, then draw a new zone.) 


Selecting a Zone Use the Selection tool to select a zone. 


k 
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Once a zone is selected, you can resize it, move it, or delete it. 


Resizing a Zone To change the size of a zone, first select it, then drag one of the 
eight handles. Use the four side handles to resize the zone 


horizontally or vertically. Use the four corner handles to resize 
the zone diagonally. 


Moving a Zone To move a zone, place the mouse pointer anywhere inside it. 
The pointer changes from an arrow into a cross hair. Now click 


and drag the mouse 


to move the zone. Release the mouse 


button to complete the operation. 


Deleting a Zone To delete a zone, first select it, then press <Del> or <Delete>. 
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Pr OCcessing In the Preview window, you can either process one page at a 
Pages time, or you can process an entire stack of pages with the same 
zone. 


Click Process One when you are ready to read a single page 


with the zone you defined. The Progress Monitor will appear 


and indicate the progress of the zone. 


Progress Monitor 


Template 


This image zone is now 
being processed. 


page 1: 67 %done 


Cancel Processing 


= Note: If you do not define a zone, WordScan will ask you 
to confirm that you want to process the entire page accord- 
ing to the Reading for setting in the Setup window (Text 
Only, Image Only, or Text & Image). 


When processing completes, WordScan erases the zone, scans 
the next page, and displays it in the Preview window, ready for 
you to define a new zone. 


= Note: If you want to save the zone you defined for future 
use, see “Using Templates” on page 88. 


Click Process Stack to process an entire stack of pages with 


ProcessStack | the zone you defined. WordScan scans and processes pages 


continuously without clearing the zone between pages. 
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Rescanning If the page was scanned at an angle, you may want to rescan it. 
Put the page back into the scanner, then click ReScan. 


WordScan scans the page again and then redisplays it in the 


Preview window. 


Adjusting the Adjust both the Preview window itself and the image in it to get 
Preview a better view of your pages. 
Window 
Sizing the You can adjust the size and position of the Preview window by 
Window using any of the following methods: 
Title bar Maximize boxes 


Control menu 


Eile 
icon 


Edit Options 


Reading for g 
hei Text Only = 
Page Orientation 

Portrait 


T Quali! 
Ba lei) y Normal | 


Fi Source 


gg BB Scanner || 


Ann | Calera's 
= En] 50% f Day 


u In Court 
Start Processing — 


Automatle Preview 


Window borders 


Click the Maximize This increases the size of the Preview window to the size of the 


box WordScan window. (If you want the Preview window to fill the 
screen, first click the Maximize box on the WordScan window, 
and then click the Maximize box on the Preview window.) 

Double-click the This alternately maximizes and restores the window to its 

Title bar previous size. 

Drag the window Point to a border and click, and then drag the mouse to change 

borders 


the size of the window. The corner edges let you change both 
the horizontal and vertical size at the same time. 


Drag the window 


Use the Control 
menu 


Minimize 
Maximize 


Adjusting the 
Image 


Fit To Window 
Quarter Scale 
Half Scale 

v Full Scale 


Fit to Window 


Quarter Scale 


Half Scale 


Full Scale 


Close Ctrl+F4 
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Point to the Title bar then click and drag. When you have the 
window positioned as desired, release the mouse button. 


Open the Control menu (the one with the space bar icon in the 
upper left corner of the Preview window) and use the 
Maximize, Restore, Size, and Move choices to change the size 
and position of the Preview window. Refer to your Microsoft 
Windows User’s Guide for more information on the Control 


menu. 


When you open the Preview window, WordScan adds the 
Zoom menu to the menu bar. The Zoom menu lets you change 
the size of the page image displayed in the Preview window. 


This choice automatically scales the page image to the size of 
the Preview window, so you will always see the entire page, 


regardless of the Preview window's size. 


This choice displays the page image at one quarter (25%) of the 
Preview Resolution. 


This choice displays the page image at one half (50%) of the 
Preview Resolution. 


This choice displays the page image at the specified Preview 
Resolution. 


Changing 
Preview Settings 


Options 


Text Settings... 
Scanner Settings... 
Verifier... 


User Dictionary... 
Preview Settings... 


Pe Bet hehe B 
Defer Processing 
Pracess Deferred Docs... 


Preserve 
Template 
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Open the Options menu and choose Preview Settings to 
change the way the Preview window operates. You'll see the 
following dialog box: 


= Preview Settings... 
O Preserve Template 


CO Grid Marks 


(Automatic Scan: 


steeransnerstnnennesenatencestnrenessransee? 


Preview Resolution: |25 x (75 dpi) + | 


Enable or disable the settings by clicking their check boxes. 


This setting is normally disabled. Enable it if you want the zone 
you create to remain on the screen after you process a single 
page. You might want to do this if you are going to process 
pages individually using the same zone for each page. 


= Note: If you are using a “named” template, your zone is 
cleared from the Preview window after processing a page 
when Preserve Template is disabled. The template, how- 
ever, remains on disk and can be retrieved again. 


For complete information about using templates, see page 88. 
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Grid Marks Enable Grid Marks to see a grid of “+” symbols in the Preview 
window at one inch intervals. Grid marks can help you 


precisely align a zone. 


Preview 


Disable Grid Marks to remove the grid from the Preview 
window. 


== Note: Grid spacing is one realinch, not one inch on your 
screen. 


Enable Automatic Scan to scan the next page of your 


Automatic Scan 
document automatically after the current page is processed. 


Disable Automatic Scan if you want to manually scan each 


scan O] page. When you do so, the ReScan button is replaced by a 


Scan button which you must use to scan each page. 


Preview 
Resolution 


100 % (300 dpi) 
50 % (150 dpi 
25 % (75 dpi) 


The Template 
Menu 


Template 


View Current Template 


Clear Current Template 


Closing the 
Preview 
Window 


Move 
Size 

Mipimize 
Maximize 


Close CtritF4 


Choose the desired Preview Resolution from this drop-down 
list box. (This setting does not affect the image resolution used 
for actual recognition.) The choices in the list are derived from 
the selected Scan Resolution that you specify in the Scanner 
Settings dialog box. For example, if the Scan Resolution is set 


to 300 dpi, the Preview Resolution list box looks like the one 
at the left. 


By default, the Preview Resolution is set to 25% of the Scan 
Resolution, which gives you a reasonably good image in the 
shortest time. (Transferring a 300 dpi image takes longer than 
transferring a 75 dpi and results in a larger image and a slower 
display speed.) If you want a more detailed image of your 
page, choose one of the other preview resolutions. 


mm Note: After you select a different Preview Resolution, Word- 
Scan does not automatically display the page at the new 
resolution. You must rescan the page to see a change in 
resolution. 


When you open the Preview window, WordScan adds the 
Template menu to the menu bar. The Template menu contains 
choices for viewing and clearing the active template, and 
shows the current template’s name Cif any). For more 
information on using templates, see page 88. 


WordScan automatically closes the Preview window when you 
are finished processing pages, either when you click Stop 
Scanning or Done, according to the type of scanner you are 
using. If you want to close the Preview window at other times, 
you can do so from the Control menu. Open the Control menu 
by clicking the space bar icon in the upper left corner of the 
Preview window, then choose Close. 


% Double-click the Control-menu icon in the upper left comer 
of the Preview window. 
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The Verifier The Verifier allows you to verify recognition and correct 
mistakes. Using the Verifier is optional, but can save you time 


in proofing because you can instantly jump to each uncertain 
character and word. 


To check and correct your document, choose Verifier from the 


Options Options menu. 


Text Settings... 

Scanner Settings... 
TS 
Verifier... 

User Dictionary... 

Preview Settings... 

Deter Processing 

Pracess Deferred Docs... 


= Note: The Verifier choice is dimmed until WordScan has 
processed at least one page. 


The Verifier window appears with the first page of your 
document displayed: 


Verifier 
Qptions Help 


Eile Edit Search Character 


| Not so long ago, even the simplest 

litigation process required hours of re- 
search in a law library, days of labor- 
ious keyboard entry into a litigation 
support database, and sometimes 
court delays while evidence was 
cross-referenced. 

With OCR and a litigation support 
package, everything from depositions 
to letters, charts, and other records 
important to a case can be placed in 


a database with speed and accuracy - --- no wonder, since these 

at an attorney's fingertips in no tine. Can search up to 80,000 ann 
One of the most successful pages at the rate of 150 pa 

litigation support packages on the second. 

market is Sunmation 11, from Summation An example of its use i 

Systers, San Francisco (price $1,295). ted by the Association of 1 


Surration 11 is the first PC program of America, located in Wash 


== Important: The document you see in the Verifier has not 
yet been formatted. The text appears in the same relative 
position as it did in the original document (or zone). Multi- 
ple columns of text will appear in multiple columns across 
the screen. WordScan will correctly reformat the text when 


it saves the finished document in your choice of text for- 
mats. 


Sine (he Verifier, © 


Using the With the first page of your document displayed in the Verifier 
Verifier window, you are ready to begin proofing. There are three 


commands you'll use more than any others: 


/ <Ctrl> + < L> to find the next uncertain character or 
word. 


Y <Ctrl> + <N> to store any corrections you made and 
load the next page of your document. 


/ <Ctrl> + <A> to add words to a user dictionary. 


There are menu commands to do these operations, but you’ll 
find that using the control-key shortcuts is the most efficient 
way to verify your document. 


Finding When WordScan processes your pages, it evaluates the letters 
Uncertain and words and assigns a level of certainty based on, among 
Characters and other things, whether the character closely matches the shape 
Words of a known character and whether the character forms part of a 
known word. Characters and words that fail to reach a 
predetermined level of “certainty” are marked as uncertain 
characters and words. 


To find each uncertain character and word, press 

<Ctrl> + < L>. If the word is correct, press <Ctrl> + < L> 
again and the Verifier removes the marker at the current 
position and jumps to the next marker. 


Correcting After finding each marker, you can make corrections Cif 
Mistakes necessary) just as you would in an ordinary word processing 
program. 


After finding a marker, the Verifier positions the cursor at the 
marked character or at the first character in a marked word. To 
make your correction, use the arrow keys, type, and use the 
<Backspace> and <Delete> keys. (The Verifier is in insert 
mode unless you change to overtype mode by pressing the 
<Ins> or <Insert> key.) 


After you have corrected a mistake, press <Ctrl> + < L > to find 
the next marker. 
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Viewing Images You also can view the images you captured while you are 
verifying your document with the Verifier. Each of your images 
is represented by an Image indicator showing where it appears 


in the sequence of your document. 


Image indicator 


Verifier 
Elle Edit Search Character Options Help 
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worlds because it offers relational 
database capabilities along with the 
full-text capabilities. 

Because of the acceptance of 
Calera’s OCR products in conjunction 
with such software packages as 
Sumnation II, a growing number of 
attorneys are actually bringing their 

ù computers into the courtroon 
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To view an image, just double-click on the Image indicator. In a 
moment, the Viewer window opens and displays the captured 
image. 


Fit To Window 
Quarter Scale 
Half Scale 

v Full Scale 


Fit to Window 


Quarter Scale 
Half Scale 


Full Scale 


Verifying More 
Pages 


You can use the scroll bars to view portions of the image that 
don't fit in the window. 


Zoom Menu 


The Zoom menu lets you change the size of the image 
displayed in the window. 


Automatically scales the image to the size of the window, so 
you will always see the entire page, regardless of the window’s 


size. 


Displays the image at one quarter the scanned resolution. 
Displays the image at one half the scanned resolution. 


Displays the image at the specified scanned resolution. 


When you finish viewing your image, choose Done to close 
the Viewer window and return to your text in the Verifier 


window. 


After you have viewed (and, if necessary, corrected) the last 
marked character or word on the page, pressing <Ctrl> + < L > 
again causes the following message to appear in the status area 
at the bottom of the Verifier window: 


No more markers 


Press <Ctrl> + <N> to store any corrections you made and load 
the next page. (The Verifier works with each page separately. 
WordScan combines the separate pages when you later save 
your finished document.) 


When the next page appears, press <Ctrl> + < L > to find and 
verify each marked character and word on the page. 


a aa 


Continue processing pages in this manner until you reach the 
end of your document. When you press <Ctrl> + <N> on the 
last page of the document, the following dialog box appears: 


Verifier 


D No more pages, exit Verifier? 


Click <Yes> to save the last page and close the Verifier window. 


Adding Words While you are proofing your document, you can add words to 

to a User the open user dictionary by pressing <Ctrl> + <A> when the 

Dictionary cursor is positioned anywhere in the word you want to add. Ifa 
user dictionary is not opened, you must return to the Setup 
window and choose User Dictionaries from the Options 
menu. Complete instructions can be found on page 38. 
Generally, you'll want to add words that you frequently 
encounter in your documents but that WordScan marks as 
uncertain because they are not in the main recognition 
dictionary. 

Remove Another timesaving feature that is normally enabled is Remove 

Duplicate Duplicate Markers (found on the Options menu). When 

Markers enabled, the Verifier skips further occurrences of any marked 


word that you verify as correct. You verify that a word is correct 
by either doing a Look for Marker (<Ctrl> + < L >) or an Add 
to Dictionary (<Ctrl> + <A>) command without making any 
changes to the marked word. 


Verifier The operations you’ve learned so far in this chapter are all you 

Reference need to know to use the Verifier for proofing and correcting 
your document. The Verifier has additional features and 
capabilities that you can use to edit text, if you wish. This 
reference section explains all the Verifier commands, menus, 
and operations. 
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Using the 
Mouse 


Placing the Cursor To quickly place the text cursor where you want to edit, point 
to the desired location and click the left mouse button. 


Selecting Text To select an area of text that you want to change in some way, 
such as making it bold, italic, or underlined, or deleting it from 
your document, use the mouse as follows: 


To select a portion of a line: 


Point to the beginning of the area you want to select. Hold 
down the left mouse button and drag to the right until you 
reach the end of the desired area, then release the mouse 
button. 


To select an entire line: 


Point to any location on the desired line. Hold down the left 
mouse button and drag slightly up or left. The entire line 
becomes highlighted. Release the mouse button. 


To select multiple lines: 


Point to any place on the first line of interest. Hold down the 
left mouse button and drag to the left, then down. The current 
line is selected when you drag to the left; by dragging down, 
you select additional whole lines. Release the mouse button. 


To select a rectangle: 


Point to the upper left corner of the area to be included in the 
rectangle. Hold down the left mouse button and drag to the 
right to define the width of the rectangle, then drag down to 
define the height of the rectangle. Release the mouse button. 


In any of the above operations, the area might extend off the 
screen. When you drag the mouse to the edge of the screen, 
the screen scrolls automatically to let you continue to expand 
your selection. 


To deselect a selected area: 
Click the mouse button anywhere in the text area. 


For more information on using the mouse, see the Microsoft 
Windows User’s Guide. 


Using the Besides typing text, you can use the keyboard to perform other 
Keyboard Verifier operations. The following table summarizes keyboard 
operations. 


Arrow keys Moves cursor one character in indicated direction 
<Backspace> Deletes character to left of cursor or selection 


<Cul> + <A> Adds the word to the open user dictionary 


Deselects a selected area 
Deletes word in which cursor is positioned . 

Finds next marked character or word 

Saves current page and loads next page 

Moves cursor one word left or right in indicated direction 
Moves cursor to the end of page 

Deletes character or selection 

Moves cursor to the end of the line 


<Page Down> or 
<PgDn> 


Scrolls the page down one “window full" 


<Page Up> or ; <A , 

<PgUp> Scrolls the page up one “window full 
Moves cursor to next tab stop. Tabs are fixed at every four 
spaces. 


= Important: The text displayed in the Verifier window con- 
tains region markers that are normally hidden. Region 
markers exist in pairs and must be deleted in pairs. If you at- 
tempt to delete or backspace over a single region marker, 
you'll see the following message: 


= Verifier 


D Cannot delete region marker. 


When this happens, click OK, then go to the Options menu 
and choose Show Text Regions. The region markers 
appear as small gray rectangles in the text. If you want to 
delete an area of text that includes region markers, you 
must select an area that contains one or more pairs of 


region markers. 


Ver ifier Menus This section describes the choices on the Verifier menus, in 
order from left to right and top to bottom. 


File Menu 


Next Page Ctri+N 
Go to Page... Ctrl+0 
Cut Page 


Paste Page 


Add ta Dictionary Ct 
Exit Verifier 


aay 
Za 
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Next Page 


Go to Page 


Cut Page 


cS <Ctrl> + <N> 

Saves the current page and opens the next page. When there 
are no more pages in the document, you'll see the following 
message: 


Verifier 


D No more pages, exit Verifier? 


Click Yes to exit from the Verifier. Click No if you want to 
continue to proof your document. 


es <Ctrl> + <O> 


Opens a specific page. You'll see the following dialog box: 


Go To Page: I] 


Enter the number of the page you want to open, then click OK. 
In a moment, the text of the specified page appears. 


Deletes the current page. You'll see the following message: 
-E 


D Cut Page 2? 


If you click Yes, all pages following the cut page are 
renumbered to fill the gap left by the page you cut. The deleted 
page is held in a temporary holding place and can be reinserted 
in your document by using the Paste Page command. 
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= Note: The Verifier does not use the Windows Clipboard. 
You cannot cut and paste between other Windows 
programs. You can only cut and paste pages from one loca- 
tion in the Verifier to another. 


Paste Page Inserts the cut page back into your document. You’ll see the 
following message: 


= Verifier 


D Paste page before this one? 


If you click Yes, the pasted page is inserted just before the 
current page. In most cases, you’ll use this command to 
reorganize your document. For example, you could cut page 5, 
then paste it before page 15. When you paste the page, all 
remaining pages are renumbered, and you continue to view the 
current page (the page you were viewing prior to the paste 
operation). You'll notice that its page number has incremented 
because of the page you just inserted before it. 


Add to Dictionary s€ <Ctrl> + <A> 


Adds the word in which the cursor is positioned to the open 
user dictionary. 


Exit Verifier Use when you are finished proofing your document or 
whenever you want to quit. If you’ve made any changes that 
you haven’t yet saved, the Verifier asks you whether or not you 
want to save them. After you answer this question, the Verifier 
closes its window. 


Edit Menu 


Clear Del 
Whiteout Cirl+W 


Clear Markers 


Undo Reverses the effect of the last Clear or Whiteout command. If 
you have not performed one of these commands, Undo is 
dimmed. 

Clear Deletes the selected area of text. If the cleared text lies between 


region markers (visible if Show Text Regions is enabled in the 
Options menu), then the text following the cleared text is 
moved to the left to fill the empty space. 


Whiteout Replaces the selected area of text with blank space. Whiteout 
deletes text without reformatting the remaining text. This is 
useful when you want to remove extraneous characters outside 
a given area of text; for example, to remove “noise” (spots, dirt, 
and stains) that has been recognized as random characters. 


Clear Markers Removes all remaining uncertain character and word markers 
from the page. The following prompt appears: 


D All markers will be removed, OK? 


Click Yes to remove the markers. Once you have removed 
markers, you can no longer search for uncertain characters and 
words. 


Search Menu 


Search 


Look for Marker Ctri+L 
Find... 


Find Next F3 
Look for Marker os <Ctrl>+<L> 


Moves the cursor to the next marked (uncertain) character or 
word. 
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= Important: For this feature to work, you must have 
enabled Mark Suspicious Words and/or Mark Suspi- 
cious Characters in the Text Settings dialog box in the 
Setup window’s Options menu). 


As you view each marked word, the Verifier removes its marker 
from the document. When you have viewed all marked words 
and characters, selecting Look For Marker again causes this 
message to appear in the status line: 


No more markers 


Find Searches for specific characters, words, and groups of words. 
The following dialog box appears: 


Find what? 


en 


Type the characters, word, or phrase you want to find, then 
click OK. The Verifier searches on the current page (only), and 
if successful, positions the cursor at the beginning of the first 
character. 


To continue this same search on subsequent pages, use Find 
Next (<F3>) on each page of your document. (The Verifier 
retains what you entered until you replace it with something 
else.) 


Find Next x <F3> 


Locates the next occurrence of the character, word, or phrase 
you entered in the Find dialog box. 
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Change Finds a character, word, or phrase and changes it to something 
else. When you choose Change, you'll see the following dialog 
box: 


Type the characters you want to search for in the upper text 
box. Click in the lower box and type the characters you want to 
replace them with, then click OK. The Verifier searches for the 
string, and if successful, displays the following dialog box: 


String found: 


Change Change this occurrence only and go to the next. 


Next Skip to the next occurrence without changing this one. 
Change Change this and all subsequent occurrences on the 
Rest page. 

Stop Stop making changes. 


= Note: To limit the scope of the change operation, you can 
first select an area of text in which to perform the change 
operation by using the mouse to highlight the desired area. 


Character Menu The Character menu has choices for changing the appearance 
of characters—normal, bold, italic, underline, or any 
combination of bold, italic, and underline. 


Character 
Normal 


Bold F6 
Italic F7 
Underline F8 


Normal 


Bold 


Italic 


Underline 


Options Menu 
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If an area of text is selected when you choose one of the 
attributes, that area of text will change accordingly. If no area is 
selected, choosing an attribute establishes the appearance of 
characters you subsequently type. Furthermore, the attributes 
of text that you type depend on the attributes of neighboring 
characters. 


To apply attributes to characters you haven’t yet typed, select 
one or more of the attributes when no text is selected. 


us <F5> 


Changes selected text to normal (no attributes) or cancels any 
selected attributes for text you subsequently type. 


c< <F6> 


Changes selected text to bold or selects the bold attribute for 
text you subsequently type. 


<< <F7> 


Italicizes selected text or text you subsequently type. 


c< <F8> 


Underlines selected text or text you subsequently type. 


= Important: The effect of these attributes on your finished 
document depends on the Text Format you specify. Most 
word processor formats retain the attributes; however if 
you select ASCII (for example), no attributes or other for- 
matting information is retained. The nature of ASCII files 
does not allow for text formatting. 


Options 
v Show Char Markers 


v Show Word Markers 
Show Text Regions 


vV Remove Duplicate Markers 


Yiew Image 


E 
5 
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Show Char When enabled (checked), marked characters are highlighted by 


Markers a yellow rectangle (a dark rectangle on monochrome screens), 
as shown below. 


When disabled (no check), marked characters are not 


highlighted. This selection does not affect the Verifier's ability 
to find marked characters. 


Show Word When enabled (checked), marked words are highlighted by a 


Markers blue rectangle (a light rectangle on monochrome screens), as 
shown below. 


=| Verifier FE 
File Edit Search Character Options Help 
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with such software packages as 
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attorneys are actually bringing their 
LJ) computers into the courtroom 


å no wonder, since these produc Word marker 
can search up to 80,000 annotated 


pages at the rate of 150 pags per 


second. 
An example of its use is illustra- 


ted by the Association of Trial Lawyers Character marker 
of Anerica, located in Washington, 
®. This national organization for 
trial attorneys with over 63,000 men- 
bers uses Calera‘s TrueScan for edu- 
cational materials, resource documents 
and research infornation-sharing 
cases. The organization installed 
TrueScan initially to help produce a 
1200 page book for its annual conven- 


When disabled (no check), marked words are not highlighted. 
This selection does not affect the Verifier’s ability to find 
marked words. 


Show Text Regions Text region markers are small rectangles that denote the 
beginning and end of regions of text. A region is an area of text 
that has a specific set of formatting characteristics, such as a 
font change, the beginning of a new column or line, and other 
format changes. WordScan uses text region markers when it 
saves your document to reconstruct the original format. You 
can only delete pairs of region markers, so you need to enable 
them when deleting or whiting out areas of text. 


Remove Duplicate 
Markers 
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When enabled (checked), region markers and text zone 
indicators (showing where each text zone begins) are visible, 
as shown below. 
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When disabled (no check), region markers and text zone 
indicators are hidden. 


When enabled (checked), the Verifier removes markers on 
subsequent occurrences of any marked word after the first time 
you verify it is correct. Duplicate markers are removed if you 
either: 


1. Press <Ctrl> + < L > or choose Look for Marker from the 
Search menu. 


2. Press <Ctrl> + <A> or choose Add to Dictionary from the 
File menu. 


Since the Verifier works with a single page at time, it adds the 
duplicate words to a list. Then, when you load each 
subsequent page of the document, the Verifier automatically 
removes the word markers from any words it finds that are in 
the list. 


== Note: The Verifier’s duplicate word list can hold up to 100 
words at one time. It saves the most recently used words. 
The list is cleared each time you exit from the Verifier. 


View Image °= Double click the Image Zone indicator 


Choose this to display an image after placing the cursor in an 
Image Zone indicator (see page 64). The command is dimmed 
if the cursor is not on an Image Zone indicator. 


Using Special You can add special characters such as fractions, accented 
Characters letters, foreign currency symbols, and ligatures to the document 
you process with the Verifier as follows: 


1. Locate the character you want to use (the Microsoft Win- 
dows User's Guide has tables that show the characters avail- 
able) and note its corresponding 3- or 4-digit code. (The 
ANSI set code must be preceded by a 0.) 


2. Press and hold down the <Alt> key. 


3. Using the numeric keypad on the right side of your key- 
board (not the number keys across the top of the key- 
board), enter the 3- or 4-digit code. 


4. Release the <Alt> key and the special character will appear. 


Characters and Codes 


Your computer uses characters sets to translate the keys on your keyboard 
into the symbols you see on your screen and printouts. These character sets 
vary between computer manufacturers. Programs running under Microsoft 
Windows use the American National Standards Institute (ANSI) 8-bit char- 
acter set. All the possible combinations of eight bits can represent up to 256 


characters; however, most keyboards let you directly type only about the 
first 128 of these by using combinations of regular, shifted, and control-key 
combinations. (The function keys F1-F12 are not a part of the character set.) 

In addition to the characters in the ANSI character set, Windows programs 
let you type the characters in your computer’s ASCII character set that have 
ANSI character equivalents. 


Help Menu When you come across a capability or feature you don’t know 
how to use, consult the on-line help system. Choose a topic 
from the Help menu or choose Index to see a list of all topics. 
You'll find information on using every Verifier feature, button, 
menu, and setting. 
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Advanced 
Features 


Deferred 
Processing 


Scanning for 
Deferred 
Processing 


Options 


Text Settings... 
Scanner Settings... 
Verifier... 


User Dictionary... 
Preview Settings... 


Defer Processing 
Process Deferred Docs... 


WordScan has many advanced features that you’ll want to use 
after you've mastered basic operation. Advanced operations are 
found under various menus and include: 


/ Deferred processing 
/ Processing from disk and fax files 
/ Saving and retrieving your work-in-progress 


/ Using templates 


Whenever you want to scan documents as quickly as possible 
and perform recognition (which is generally much more time 
consuming than simple image scanning) later, use deferred 
processing. Deferred processing lets you scan pages as fast as 
your scanner can go. The page images are saved for 
recognition processing at a later, more convenient time. 


1. Choose Defer Processing from the Options menu. (You 
must start a new document before you can choose Defer 
Processing.) You'll see the following message: 


© All pages scanned will be saved as a deferred 
job for future processing. 


2. Click OK to proceed. Note that the Setup window changes 
to say Start Processing (Deferred) as a reminder that 
when you click Automatic or Preview, your pages will 
only be scanned; they won’t be recognized until later. 


— tion 


Start Processing (Deferred) 
[emane] (C Preview I] 


New 
Open... 


——_,,\ 
Save... 


Save Unconverted... 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 


Save Document for 
Proofing check box 


3. 


Click either Automatic or Preview. Processing occurs just 
as in normal (non-deferred) mode, except that recognition 
doesn’t happen at this time. 


When you’re finished scanning, you can do either Step 4 or 
Step 5, below. 


If you have scanned all the pages of your document, 
choose Save from the File menu. You'll see the following 


dialog box: 


= Save As Document... 
Document Name: | |] 
Directory: c:\ 


Text: |Microsoft Word 5.0 E 
Image: |PageMaker (TIFF) B 


I Save Document for Proofing 
O Save Markers As: E] 


Directories: 


The only difference between this dialog box and the nor- 
mal Save dialog box is the check box labelled Save Docu- 
ment for Proofing. If you enable this option (X in the 
box), WordScan saves the document in its internal format 
for later proofing. 


Enter a document name and click Save. You’ll see the fol- 
lowing alert box: 
o WordScan 


Scanning for chapter! is complete and deferred. 


Use the Process Deferred Docs menu Item 
when you are ready to process this document. 


This document will NOT be converted to 
Microsoft Word 5.0 format. After processing Is 
complete, It will be saved for proofing within 


WordScan. 


= When processing Is complete, use Open... to 
retrieve the processed document, 


* After proofing Is complete, use Save... to 
Convert to text format. 
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Save Unconverted... 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 


This message is explaining that since you are saving for 
proofing, the text and scanner settings that you made will 
be ignored and you'll have to choose Save again after 
proofing to complete your finished document. 


When disabled (no X in the box), WordScan automatically 
saves your document after it finishes recognition, using the 
name, text format, and image format you specify here, 
without giving you the opportunity to proof. Use this op- 


tion when you don’t want to proof your document before 
Saving it. 


If you want to stop now and add more pages to your 
deferred document later, choose Save Unconverted from 
the File menu. The following dialog box appears: 


= Save Unconverted... 


Save As: 


er ln 


Unconverted: 


Cancel 


Type a name for your unconverted document and click 
Save. Later, you can choose Open from the File menu to 
continue adding pages to your deferred document. 


Important: You cannot process your deferred document 
until you save it by performing a Save operation. If you 
choose Save Unconverted, you must later open the docu- 
ment by choosing Open and then save the document by 
choosing Save before the document will appear in the list 
of deferred documents. 


Processing 
Deferred 
Documents 


Options 


Text Settings... 
Scanner Settings... 
Verifier... 


User Dictionary... 
Preview Settings... 


Deter Processing 
Process Deferred Docs... 
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Follow these steps to process (perform recognition on) any 
saved deferred jobs: 


1. Choose Process Deferred Docs from the Options menu. 
The following dialog box appears: 


= Process Deferred Docs... 


Deferred Documents: [Process ] 
CHAPTER1 
Delete 


2. Select the document or documents you want to process. To 
select multiple documents, just click on each one so that it 
is highlighted. If you accidentally select one you don’t want 
to process, click on it again to unselect it. 


3. Click Process. WordScan processes the documents while 
displaying a window similar to the following: 


Calera WordScan 
Elle Edit Options Help 


Progress Monitor 


page 1: 36 %done 


Cancel Processing 
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Processing Disk 
and Fax Files 


When processing completes, one of the following occurs: 


a. 


If you enabled Save Document for Proofing (X in the 
box) when you saved the deferred job, WordScan saves 
the document in its internal format using the name you 
specified. 


= Important: When deferred processing completes, you 
must choose Open from the File menu when you are ready 
to proof the document. When you finish proofing the docu- 
ment, choose Save from the File menu to specify a docu- 
ment name, text format, image format, and other save 
options. 


b. 


If you disabled Save Document for Proofing (no X in 
the box), WordScan automatically completes process- 
ing and then saves your finished document according 
to the specifications you chose just after you scanned 
the pages for the deferred job. For example, if you 
entered the name chapterl and selected a text format 
of WordPerfect and an image format of PageMaker 
(TIFF), the text and images of the document are saved 
in these formats as soon as processing is complete. In 
this case, you do not have the opportunity to proof 
your document. 


You can use WordScan to recognize the text and/or capture the 


images in disk or fax files. (Disk files are files that you have 
created using another program. Fax files are files from a PC fax 
card that receives faxes and stores them as image files on your 


hard disk.) The process is almost identical to processing images 


from a scanner, except that you must specify the file(s) to 
process, as follows: 


1. Select Disk or Fax in the Setup window. 


2. Click Automatic or Preview, as desired. You'll see the 
dialog box on the following page. 
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= Scan From Fax Files 
Select Image Files to Process 


Files: Directories: 


cart_01.pcx 


Directory: c:\drawings 


Image File Types: 


Selected Image Files: 


c:\drawings\cart_01.pcx 
c:\drawings\cart_02. pcx 


3. Select the type of image file you want to process from the 
Image File Type drop-down list box. You can select from 
among the following types: 


File Types: 


*TIF Tagged Image File Format (TIFF), used by some 
paint programs and supported by most desktop 
publishing programs. 


*PCX ZSoft PC Paintbrush format, used by many fax 
cards, paint programs, and desktop publishing 
programs. 


*PDA Processed Document Architecture, Calera’s own 
proprietary text and image file format. PDA files 
can be saved by WordScan and by all other Calera 
recognition products, and can subsequently be 
opened by using this selection. 


*DCX A file format used by some fax card software that 
is similar to .PCX but allows multiple images per 
file. 


u Shows you all file types in the selected directory. 
Use this option when your file names do not 
follow standard naming conventions, or are not 
one ofthe above types. (WordScan can also 
process fax files from AT&T FaxConnection, 
Gammalink Gammafax, Spectrafax the FAXCARD, 
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Datacopy Microfax, the Intel Connection 
Co-Processor, and the Panasonic Fax Partner, as 
well as any other 200 x 200 dpi fax image (fine 
mode) in TIFF, PCX, or DCX format.) 


= Note: You can select only one file type at a time. If you 
need to process multiple image files of different types, you 
must do so in groups; for example, processing all .PCX files 
at one time, then processing all .TIF files the next time. If 
you choose “*.*” you must be careful not to select files of dif- 
ferent types. If you do, WordScan will display an error mes- 
sage when you start processing the files and you'll have to 


select the files to process again. You can specify up to 499 
files of one type at a time. 


4. Use the file list box to select the drive and directory you 
want to use. Note that directory names are enclosed in 
square brackets ([calera]). Double-click on a directory 
name to open that directory. To go up a level to the parent 
of the current directory, double-click on the [..] symbol. 


5. After selecting the desired directory, find the file you want 
to process, then click the filename to highlight it. Click 
Select to put the file into the Selected Image Files list box. 
If you want to process multiple image files, repeat this step 
to select additional files. Or, you can highlight multiple 
filenames by clicking once on each desired file. If you inad- 


vertently highlight the wrong file, click on its name again to 
unselect it. 


$< Double click each filename to select it. 


If you accidentally select an image file that you don’t want 


to process, click on it in the Selected Image Files list box, 
then click Remove. 


To begin processing the file or group of files you selected, click 
OK. Recognition will proceed normally until all selected files 
have been processed. 
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Saving Your When you want to save your work but you haven't finished scanning, 


_In_ recognizing, or proofing pages, choose Save Unconverted from the 
Work-In Progress File menu. You'll see the following dialog box: 


Save As: 


men ee 


U ted: 
Save Unconverted... neonvelté Cancel 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 


After you choose Save Unconverted, you can quit WordScan or 
choose New to begin work on a different document. Your 
document's pages are saved in WordScan’s internal format so that 
later you can reopen them and continue processing. To retrieve 
your document, choose Open from the File menu, then select the 
desired document name. You can then proceed to add more 
pages to the document or use the Verifier to proof your document. 


Retrieving Your To retrieve your work after you have used Save Unconverted 
oe or after you have processed deferred pages with Save 
Work-In Progress Document for Proofing enabled (an explanation of proofing 
begins on page 62), choose Open from the File menu. You'll 

see the following dialog box: 


Open... 


Save... 
Save Unconverted... 


Retrieve Settings... 
Save Settings... 


Pope. | 
Open Pages Name: 


ee 


Pages: 
NEWSLETT 


TESTPAGE 


Retrieve Template... 
Save Template... 


Choose the document you want to open by clicking its name, 
then click Open. 


Using Templates 


When you need to process multiple pages having a similar 
format, the easiest way to do so is by using templates. To 
define a template, follow these steps: 


Defining a 


1. Scan a sample page of the type that you will use with the 
Template 


template. 


2. Define a zone as described in Chapter 5, “The Preview Win- 
dow.” When you finish, you have defined a template, 


which now appears in the Template menu under the name 
Untitled. 


Saving a Template 1. Choose Save Template from the File menu. The following 


dialog box appears: 
File 


so 
Open... 
Save... 
Save Unconverted... 
Retrieve Settings... 
Saye Settings... 
Retrieve Template... 


Save Template... 


= Save Template As... 


Save Template Name: 


Templates: 


LEGAL 
NEWSLETT 


2. Type a name of up to eight characters for your template, 
then click Save. 


WordScan saves your template for use with future recognition 
jobs. 
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Saving Templates Instead of saving your template by itself, you can save your 

with Settings template as part of a Settings file. Then, each time you retrieve 
that Settings file, WordScan will also retrieve your template. To 
save your template with your settings, follow these steps: 


1. Choose Save Settings from the File menu to save your set- 
tings. 


Save... 
Save Unconyverted.. 


= Save Settings As... 


Save Settings Name: 


Cd 
Settings: 


Retrieve Settings... 


Save Settings... 


Retrieve Template... 
Save Template... 


2. Type a name for your Settings file, or choose one of the ex- 
isting names, if desired, by clicking its name. (You probably 
don’t want to choose DEFAULT, because WordScan will 
then load your template every time you launch it.) 


3. Click Save to save all current settings. (For an explanation 
of all the settings that WordScan saves, see page 48.) You'll 
see the following message: 


= WordScan 


Do you want to save template as part of 
<<NEWSLETT>> ? 


4. Click Yes to save the template with the Settings file. Word- 
Scan gives the template the name that you specified for the 
Settings file, so you can later retrieve just the Template 
without all the other settings, if desired. 
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Retrieving You can retrieve any saved templates by choosing Retrieve 
Templates Template from the File menu. When you do so, you'll see the 
following dialog box: 


File 
New 
Open... 
Save... 
Save Unconverted... 
Retrieve Settings... 
Save Settings... 


= Retrieve Template... 


Retrieve Tempiate Name: 


Re 


Templates: 


Retrieve Template... 


Save Template... 


Select the template you want to retrieve by clicking its name. If 
you don’t remember which template is which, you may be able 
to tell by examining the zone in the template. To do this, click 
the View button in the Retrieve Template dialog box. A 
miniature view of the template’s zone appears, as shown here. 


= View Template 


Zone in: TEXTBOOK 


Landscape letter 
outline 


Zone outline 


Landscape 
legal outline 


Template 
View Current Template 


Clear Current Template 


When the template is retrieved, its name appears at the bottom 
of the Template menu and its zone appears in the Preview 
window Cif the Preview window is opened). 


Using Templates Or 


If the Preview window is open and you want to use the 
template in Automatic mode, close the Preview. Otherwise, you 
can remain in Preview mode and use the template for 
processing while previewing as described in Chapter 5, “The 
Preview Window.” 


Automatic To use the template you selected in Automatic mode, follow 
Processing with these steps: 
Templates 


1. Select the Templates icon under Reading for. 


2. Place your original page(s) in the scanner and click Auto- 
matic. WordScan processes each page with the template 
you defined. 


= Note: If you want to know which template is active at any 
given time, look in the Template menu that appears when 
you select Reading for Templates. 


Deleting Templates When you no longer need a template, you can delete it from 
your hard disk by following these steps: 


1. Choose Retrieve Template from the File menu. The 
Retrieve Template dialog box appears: 


Save... 
Save Unconverted... 


= Retrieve Template... 


Retrieve Template Name: 


fo) oe 4 


Templates: 


Retrieve Settings... 
Save Settings... 


Retrieve Template... 
Save Template... 
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2. Select the template you want to delete by clicking its name. 


3. Click Delete. The following alert box appears: 


Do you really want to delete template 
<<TEXTBOOK>> ? 


Cancel 


4. Click OK to delete the template. 


Chapter 8 
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Most from 
WordScan 
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Getting the 
Most from 
WordScan 


Checking 
Recognition 


Not all documents are created equal. Some documents have 
more complex layouts and type styles than others. In general, 
WordScan is more likely to recognize simple documents with 
the least difficulty, but can do a remarkable job recognizing 
most poor-quality documents if you know how to get the best 
recognition. 


This chapter explains the factors that affect the way WordScan 
recognizes documents and what you can do to get the best 
recognition. You'll find information on: 


/ Conditions that produce the best scanning results. 


/ How to improve your results, based on the clues you see 
in the Verifier and in your finished document. 


/ How your scanner and WordScan read the layout of your 


document and how that might affect the accuracy of the 
job. 


/ How to evaluate documents before you scan them, so 
that you can anticipate and minimize potential problems. 


No two scanners are alike. Even two scanners from the same 
manufacturer may have different ideas of what “normal” 
brightness is, and may scan the same document with different 
levels of success. 


To determine the quality of the scan, set Preview Resolution 
(in Preview Settings under the Options menu) to the highest 
setting. Then scan a page into the Preview window and look at 
the image. If characters look “light” and broken, you may want 
to darken the Brightness setting and scan again. (Remember 
that the Brightness setting only affects future scanning.) If the 
characters are “dark” and joined or touching, you may want to 
lighten the Brightness setting. 


You will sometimes encounter limitations that you can do 
nothing about, such as a document that has been photocopied 
too many times, or a scanner that is blind to blue and ignores 
blue text on a magazine page. Most often, however, you can 


A Few Rules 


find ways to take advantage of your scanner’s particular 
capabilities and minimize or fix the recognition mistakes 
WordScan makes. The rest of this chapter gives examples of 
ways scanners may read different kinds of documents, and 
offers suggestions about how to make your scanner and 
WordScan work best for you. 


Refer to Installation Notes for Scanners and Fax Cards for 
specific information about the capabilities and limitations of 
various scanners. 


In a perfect world, every document you want to recognize 
would be an original in black ink on clean white paper. It 
would be free of wrinkles, coffee stains, and margin notes. 


Since the information that you need to recognize probably 
comes from a variety of sources (newspapers and journals, 
company reports, lists, charts and diagrams, and files from 
computer fax cards), you are unlikely to achieve this ideal. 
However, you can approach it by following a few simple rules. 
As you gain experience with your scanner and with the 
particular demands of the documents you need to recognize, 
you can make up your own rules. Here are a few standard rules 
to get you started: 


U} Make sure your scanner is clean, especially the sur- 
face from which it scans the document. 


2 Make sure you insert the document correctly and 
straight, according to the rules for your scanner. If the 
text on the page is slanted, or the page gets twisted while 
it is being scanned, WordScan may have trouble making 
sense of it, even if you have otherwise inserted the page 
correctly. This may result in nonsense on the page, miss- 
ing characters, or split lines of text. (If you are scanning 
several pages from the same document and the recogni- 
tion on one page is significantly worse than the others, 
there is a good chance that page went into the scanner at 
a slant.) 
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(1 Choose the correct orientation according to the way 
your document appears in the Progress Monitor or 
Preview window. The document must always read from 
top to bottom for proper recognition. 


(J Change the Brightness setting to compensate for text 
that is very light or very dark. Remember that the settings 
work like those on a photocopier. For example, to com- 


pensate for a dark page, choose a setting more toward 
Lighten. 


Q Change the Type Quality setting when reading draft- 
quality dot-matrix or a page that is entirely monospaced 
with characters that touch (typical of 10-pitch typewriters 
set to 12-pitch spacing). 


Q Use a zone to exclude “noise” when reading pages 
that contain handwritten notes, stains, and other un- 
desirable items. 


Signs That You can tell if WordScan is having trouble with a document 
Recognition even before you look at the finished product. If recognition 
seems to be taking too long, then it might be worth clicking 
Can Be Cancel Processing on the Progress Monitor, checking the 
Improved settings and page orientation, and starting the scan again. Of 


course, “too long” is a relative term. You will soon get a sense 
of how long it takes to process the kinds of documents you 
usually scan. The times generally vary from fifteen seconds to 
three or four minutes, depending on the version of WordScan, 
the speed of your computer, and the quality of the original 
document. 
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Problems that may cause an unusually long processing time 
include: 


(J Improper page orientation. 


UI) Misaligned page causing the lines of text to be slanted 
with respect to horizontal. 


(J Background “noise” in the original document such as 
stains, stray marks, or handwritten notes. If you can’t get 
a clean copy, try changing the Brightness setting. 


(J Badly degraded page or page with poor contrast be- 
tween the text and the paper. 


If, upon examining your finished document, you find there are 
many errors, refer to the following sections for information on 
how to improve recognition. 


Corr ecting Text If WordScan does not recognize a word, that word will usually 
Recognition be “marked.” The effect of marking depends on the Text 

Settings (Mark Suspicious Characters, and Mark Suspicious 
Pr oblems Words) you chose prior to processing the document. 


Characters and words are marked as “suspicious” when 
WordScan is uncertain of their accuracy. WordScan performs 
many different kinds of checks for accuracy and combines 
these checks into a total “certainty score.” One of these checks 
is to see if the recognized word appears in the built-in and 
user-defined dictionaries. If WordScan finds a match, the word 
generally is not marked. However, if you are scanning text with 
an unusual vocabulary, you'll find many marked words that 
actually were recognized correctly. See page 71 for information 
on adding words to user-defined dictionaries to improve 
recognition accuracy and reduce the number of marked words. 


If you see many marked characters and words that are actually 
misrecognized, WordScan is alerting you to a scanning 
difficulty, usually one of those explained in this chapter. 
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Joined 
Characters 


Split Characters 


Misplaced 
Capitals 


“Joined characters” are two or more characters recognized as 
one, such as when an m appears in place of rn or when h 
appears in place of li. Joined characters are often a problem 
when you are scanning a document with a colored background 
or a document that has been photocopied many times. Color 
and photocopying reduce the contrast between the text and the 
page background. Also, when text has letters that are printed 
close together or condensed, WordScan may have trouble 
recognizing the characters correctly. 


Some laser printers produce tightly spaced characters that lead 
to joins. This is particularly true of Times Roman. 


If you have character and word marking enabled, WordScan 
will usually mark as suspicious those words in which characters 
have been joined. If you find many of these in your document, 
set the Brightness to lighten the page and make WordScan 
more sensitive to the space between the letters. 


“Split characters” occur when one character is read as two or 
more, such as iii for m or li for h. If you see many of these split 
characters (this very often looks like lots of i's on the page), use 
the Brightness setting to darken the page and make WordScan 
less sensitive to the breaks in the letters. 


If the document is typed or draft-quality dot-matrix (where you 
can see individual dots), you can often solve this problem by 
setting the Type Quality to Dot Matrix (but use this setting 
only if the text is monospaced—not proportionally spaced). 
Also, photocopying light documents at a darker setting 
sometimes helps. 


WordScan may have trouble deciding where to place a very 
large initial capital letter that spans several lines. These 
“dropped capitals” are often used by magazines and books to 
start a new article or chapter. 


More often, WordScan places the letter on the lowest line to 
which it reaches. If you have the character and word markers 
enabled, the first word on the line where the letter appears may 
be marked. By using the Verifier, you can easily move these 
misplaced capitals. 
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their hearts out and smiling their big- 

gest Broadway smiles. It's hard to see 
the smiles, though, since the ensemble is 
wearing the white head-to-toe outfits com- 
mon to high-tech “clean rooms,” where 
computer chips are made. But this is not 
Broadway. As the dance number (""Bunny- 
Suit Rock") ends, Intel Corp. cofounder An- 
drew Grove steps out of costume and gives 
his employees a pep talk. 

Such antics may seem odd for the compa- 
ny that builds the microprocessors found 
in every IBM PC. But that’s why the busi- 
ness of show business for business is boom- 
ing: even the unlikeliest companies are 


A h, showbiz. The hoofers are dancing 


injecting a bit of flash and dazzle when- 
ever they roll out a new product, take part 
in a trade show or just want to have a good 
time. To celebrate its 20th birthday, Intel 
hired the “industrial theater” expertise of 
Ray Bloch productions in New York. When 
the fivecity tourended last weekend, it had 
played to about 10,000 company employees 
and their families. Eight months in the 
making, "20 Years Strong” was “almost 
as close as you get to a real Broad- 
way show,” says producer Ken Robbins. 
Well, maybe. At least they cost about as 
much—from $100,000 to several million 
for the big Detroit auto extravaganzas. 
Many of these meetings utilize dazzling 
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If the letter is very large—larger than WordScan can recognize 
(28 points at 300 dp)—WordScan may remove it from the text 
file. In that case, use the Verifier to replace it. Since dropped 
capitals occur very infrequently in most publications, this 
problem should not significantly impact your productivity. 


Sometimes you might find that some portion of your document 
has been dropped altogether. This may happen if the page is 
skewed as it feeds into the scanner. Try feeding it again. 
WordScan can tolerate a misalignment of up to about 2” before 
recognition is affected. And even though a slightly skewed 
document will produce satisfactory recognition accuracy, 
recognition speed is best when the document is straight. 


Missing Text 


Also, some scanners are blind to some colors. If your text or 
image is in a color that is invisible to your scanner, you will not 
be able to capture it. Sometimes photocopying the document 
before you scan it will help; however, many photocopy 
machines are also blind to certain colors. Typical colors to 
watch out for are blue, yellow, and most pastel colors. 


Broken Lines 
and Nonsense 
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In the same manner as color blindness, some scanners see 
certain colors as black, no matter how light the color appears to 
be on the original. This often causes a problem when the 
document has been highlighted with a translucent blue or 
yellow pen. In this case, text will be missing because the 
scanner sees solid black where shapes of characters should be. 
Again, photocopying the document first may solve this problem. 


Text also may be lost when handwritten notes, signatures, and 
other “noise” are touching or are very close to characters and 
interfere with recognition. 


The accompanying Installation Notes for Scanners and Fax 
Cards provides information about the colors not picked up by 
each of the supported scanners. 


If your processed document shows several different kinds of 
errors, the original probably went into the scanner at an angle. 
If an entire page is reproduced as nonsense, then the document 
was probably scanned in the wrong orientation. Check the 
page image in the Progress Monitor or in the Preview window. 
The page should always read from top to bottom with respect 
to the screen. 


If the lines of text on the original are not straight, try 
photocopying it to line it up properly on the page. If large 
portions of the page were recognized accurately, but there are 
far too many i’s, then the page is probably too light. You may 
also get garbled text if you try to read draft-quality dot-matrix 
pages without setting the Type Quality to Dot Matrix. 


m= Note: On some magazine pages, the spacing between 
words is so small that the spacing in the scanned document 
may be incorrect. Generally, the recognition will be good, 
but you might need to fix the spacing manually in the 
Verifier or your application program. 


Handwritten 
Notations 


Errors with 
Colored 
Backgrounds 


Gorrecting Text Recocmiion Problems 101 


WordScan treats signatures and other handwriting as images 
and usually will drop them when you're reading for text only. 
Some margin notes may leave a trail of “noise” in the scanned 
document, because WordScan tries to read them as text. It is 
best to use a “non-photo blue” pencil or pen to make notes on 
documents you plan to scan, if at all possible. 


Publisher 
Road and Track 

Box #1757 

Newport Beach, CA 92663 

Dear Publisher; 

| would [ike to ask permission to reprint the article entitled "How to Drive,” by Peter 
Egan, from the April, 1988 Issue of Road & Track magazine. Here's why. 

My company makes intelligent page readers that take text and graphics off any printed 
page, putting the information directly Into computer-processable form. That means you 
can access it Immediately in any of today’s popular word processing or desktop Pr 
lishing programs. Although Calera has primarily supplied this technology to 
businesses that process large volumes of paper, ike Insurance claims processors, 
CD-ROM builders, book publishers, and the government, we are Introducing an excit- 
ing new product for personal computer users. 

Normally, we ship test documents along with our scanners so that customers can ver- 


ify that their machines are working correctly. As part of the new product, | am building 
a test deck of typical business or personal use documents, and | would very much like 


to Include the above-mentioned article. 


It works in a number of ways: it’s humorous, It’s educational, and it has an Interesting 

layout. In particular, the complex layout (3-columns with picture, run-around text, etc), 
is a good one to show off the capabilities of our product. 

, W/\J | would very much appreciate your reply, Indicating the procedure for obtaining reprint 

X permissions. If you need further detalis, please do not hesitate to call. | look forward 
to working with you on this. 


Laura J. Ness 


ALU 2 Pior 


Publications Manager 


Your scanner may have trouble adjusting to two different 
backgrounds on the same page. It may not be able to read the 
darker area at all. Sometimes adjusting the Brightness setting 
will help. If this does not solve the problem, you may have to 
scan the page twice, changing the Brightness setting for the 
second background. If your scanner picks up all the text this 
way, you can use Preview mode to eliminate the poor-contrast 
portion of each scanned image. 


Keep in mind that some scanners read certain colors as black. If 
the background color is seen as black by your scanner, 
WordScan drops that section of text entirely. 
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Noise 


Staple holes 


Sideways text 


Margin notes 


Three-ring binder 
holes 


Paper clips 


Coffee stains 


If it is impossible to remove the source of the noise from your 
original document, it is normally best to remove the effects of 
the noise by using the Verifier, so that the document structure 
analysis that occurs when you save your document does not 
have to deal with additional confusion. Certain elements will 
cause somewhat predictable problems: 


Might be recognized as periods or commas. 


Might be recognized as noise. Use Preview mode to eliminate 
sideways text (and read the page again to recognize it in the 
proper orientation, if necessary). 


Might confuse decolumnization and/or WordScan might treat 
them as an image—as it does with signatures. 


Cause various results—usually extraneous characters. 


WordScan reads them as random characters or images. 


Depending on how dark they are, WordScan might read them 
as images or noise. 


Such noise problems may cause layout problems if they are not 
removed prior to saving the finished document. If using the 
Verifier is not an acceptable solution, try using Preview mode 
to eliminate the problem areas of the page. In all of the above 
cases, using a zone will greatly improve the chances of good 
recognition. 
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Corr ecting Most often when you are scanning a document that presents 

Layout Problems text in columns (such as a magazine or newspaper page), you 
will want to capture the information in continuous paragraphs 
without the columns. The process that accomplishes this for 
you is called decolumnization, and is required by most word 
processors. Decolumnization formats multiple columns in 
normal reading order for use in your word processor or other 


program. 


WordScan can automatically decolumnize documents with up 
to five columns across; you only have to scan the page once. 


Pages with multiple 
columns are usually 
decolumnized from left 
to right and from top 
to bottom 


Decolumnization 
translates this into this 


Centered text may 
appear flush left 


Blank lines separate 
what were columns 
in the original 
document 


HTT 


Other details such 
as hyphens and 
where line breaks 
appear depend on 
your word 
processor or 
desktop publishing 


program 


WordScan applies a high degree of intelligence to 
decolumnization—if WordScan sees that the columns are part 
of a table, it doesn’t decolumnize them. 


WordScan may misread the order of the columns of text in your 
document. In rare instances where this happens, you can easily 
reorder the text in your application program after saving. 
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If you set the Text Format to ASCII, WordScan will save the 
text exactly as it appears on the page without decolumnizing it. 
You then can use your word processor or other program to 
change the layout of blocks of text. There may be times when 


moving text yourself is the simplest way to get the layout you 
want. 


If you want ASCII text output but want decolumnization to 
occur, choose Decolumnized ASCH as your text format. This 
will provide the benefits of decolumnization without further 
word processor formatting produced by the other text formats. 


Getting the Best The detail and accuracy of scanned images is determined by 


Image Scans the capabilities of your scanner and the quality of your original 
documents. There are two kinds of images: 


Q Line art: black lines on a light background, with no inter- 
mediate shades of gray. All scanners can produce a bi- 
nary scan of line art which you can then import into your 
graphics application. 


Ol Halftones: photographs and other graphics that have 
shades of gray or color. Using a dithering pattern (avail- 
able in most scanners) simulates the shades of gray or 
color. If dithering is available in the scanner, you can 
select the method of dithering under Scanner Settings 
in the Options menu before scanning the page. However, 
be aware that WordScan won’t let you process text in 


either Automatic or Preview mode with Image Type set 
to Dither. 


Appendix A 
Factory 
Default Settings 
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The following screens show the factory default settings for 


WordScan. 


Setup Window - Untitled (0 pages) 


Reading for 
a] Text Only 


Page Orientation 


Portrait 


Type Quality 
Image Source 
Scanner | 


Start Processing 
| astomsie | [Preview] 


= Preview Settings... 


O Preserve Template 
O Grid Marks 


Automatic Scan! 


Preview Resolution: |25 x (75 dpi) |] 


= Text Settings 


Cancel 


Format: | ASCII B 


= Scanner Settings 
papain 


© Binary! 
O Dither: [4x8 Fatting [>] [ Cancei | 


Contrast: 


© «DI Ess) sex 


Page Size: 
[o Lete OA4 O Legal 


Scan Resolution: |300 d.p.i. E 
Image Format: | PageMaker (TIFF) [2] 


Appendix B 
Troubleshooting 


Troubleshooting Use this appendix for any of the following reasons: 


/ You couldn’t complete installation because the 
installation program had problems with your system 
configuration. 


Y You just finished installing WordScan and it does not 
work. 


/ WordScan was working correctly but suddenly stopped 
working. 


/ System performance is not acceptable. 


Throughout this appendix, the directory C:\CALERA refers to 
the directory in which you installed WordScan. If you installed 
WordScan in a different directory or on a different disk drive, 
substitute the appropriate information when necessary. 


The First Step Before you do anything else, run the Technical Assistant 
program. The Technical Assistant can instantly find many 
problems that prevent proper installation or operation of 
WordScan. If the installation program fails, or if WordScan 
won't launch properly, you’ll be instructed on the screen how 
to start the Technical Assistant. 


/ If you want to run the Technical Assistant from the DOS 
prompt, type: 


c:\calera\bin\checkout <Enter> 


Y Ifyou want to run the Technical Assistant from within 
WordScan, choose Technical Assistant from the Help 
menu. 


Read the instructions and advice on the screen to determine the 
cause of your problems. If you can’t get far enough to run the 
Technical Assistant, then follow the Basic Installation Checklist 
below. 
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Basic Use this checklist to find and remedy the most common 
Installation installation problems. 
Checklist 


I Make sure your PC starts up normally (without any error 
messages) and can run other software. 


Make sure the scanner is plugged in, turned on, and that 
the cable is securely fastened to both the scanner and the 
interface card in the PC. Cable connections are a very 


common trouble spot. 


L 


LI Make sure the scanner is plugged in, turned on, and that 
the scanner is working according to the manufacturer's 
specifications. If you haven’t done that, it is important to 
do so now, before continuing with troubleshooting. If 
you have another software package that uses this scan- 
ner (such as the software supplied with the scanner), use 
it to see whether the problem is specific to WordScan or 
common to other programs. 


(J) Make sure Windows version 3.0 is installed and working 
correctly. Try running other Windows software such as 
Paintbrush, Notepad, or Calculator to see if they run 
properly. The diagnostic program will check for the 
proper version of Windows and report what it finds. 


O Make sure that your system has enough memory. You 
need a minimum of 2 megabytes for DOS, Windows, and 
WordScan—more if you want to load other programs at 


the same time. 


Q Be sure to reboot your computer after installing Win- 
dows so that the Windows subdirectory is added to your 
path. If the WordScan installation program cannot find 
your Windows subdirectory in the path, the installation 
program will terminate. For important information on sys- 
tems with both Windows 2.x and Windows 3.0, see page 


113. 
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(J From the Windows Program Manager, choose About 
Program Manager from the Help menu to determine 
the mode in which Windows is operating and the 
amount of memory available. Windows should be run- 
ning in either Standard or Enhanced mode (preferably 
Standard mode which can result in up to 200% faster per- 
formance) and should report at least 1000 Kbytes of free 
memory. To start WordScan with Windows in Standard 
mode, type win /s wordscan. Consult the installation 
and configuration information included with Windows to 
determine how to correct any problems. 


U Ifyou are using a disk cache program (other than the 
SMARTDRV.SYS program included with Windows), tem- 
porarily disable it and try to install again. Some cache 
programs interfere with proper memory allocation. Once 
you complete the installation, enable the cache program 
and make sure it works. You should never run Windows 
without a disk cache program—performance will not be 
acceptable. In most cases, you'll want to use 


SMARTDRV.SYS. 
If WordScan If WordScan worked at one time, but suddenly stopped 
Has Stopped working, you may have inadvertently altered your system 
W I ë configuration. First, try running the Technical Assistant program 
or g as described on page 108. Then review the following questions 


to help pinpoint the problem. If this does not help, try 
reinstalling WordScan before calling for help. 


Q Is the problem with your scanner, your scanner interface 
card, or WordScan? To find out, try using WordScan to 
process the disk file WARRANTY.PCX. This sample file 
can be found in the C:\CALERA\DIAG directory. It is a 
scanned image of a page of text, included so that you 
can isolate WordScan problems from scanner problems. 
If WordScan can successfully recognize WARRANTY.PCX 
(recognition probably won't be perfect), it is very likely 
that something is wrong with your scanner or scanner in- 
terface card. If WordScan won’t recognize the file, then 
proceed with the following questions. 
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Have you installed cards in your computer that might in- 
terfere with the memory addresses being used for your 
scanner interface card? If so, you may need to recon- 
figure the new cards to use other memory addresses, 
DMA channels, or interrupts. 


Is your scanner plugged in, turned on, and securely con- 
nected to your interface card? This can cause errors when 
launching or running WordScan. Can you run the scan- 
ner with any other software package? 


Have the system configuration files (AUTOEXEC.BAT 
and CONFIG.SYS) been modified recently? Installing a 
new application program may have modified these files 
in such a way that they will no longer work with Word- 
Scan. (To determine the last time they were modified, 
you can check the date and time displayed by the DOS 
DIR command.) 


Is your path too long? The DOS path statement can be 
128 characters long, including the “PATH = ." If you in- 
stalled a new program that modified the path, your path 
may now have too many characters, preventing the 
necessary directories from being included in the system’s 
path. To determine the actual path in use, type set 
<Enter> and look at the PATH line. Modify the path 
statement in your AUTOEXEC.BAT file if necessary. 


Did you recently install a new Windows application pro- 
gram? If so, it may have changed the EMMSize parameter 
in the SYSTEM.INI file. For proper operation, the 
EMMSize parameter should be left blank: 


EMMSize = 


If you elected not to have the WordScan installation pro- 
gram modify your AUTOEXEC.BAT file, it created a 
batch file named wordinit .bat . Did you remember 
to run this batch file prior to launching WordScan? 


Have you installed memory-resident application 
programs that take up too much memory or conflict with 
Windows operation? Or, have you loaded other Win- 
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dows programs that are leaving too little memory free for 
proper operation of WordScan? Choose About Program 
Manager from the Windows Program Manager Help 
menu to check the amount of free memory. 


Is Windows running in the proper mode? Windows 
should be running in either Standard mode (preferable) 
or Enhanced mode (acceptable). It should not be run- 
ning in Real mode. Again, choose About Program 
Manager from the Help menu to determine which mode 
Windows is in. 


Certain programs load their own memory-resident 
programs that can remain in memory when you exit from 
the program. In particular, the PC Paintbrush line of 
products from ZSoft Corporation load a memory-resident 
screen capture program called FRIEZE when started. 
During PC Paintbrush installation, you can specify that 
you want to remove FRIEZE from memory when exiting 
from the program, or you can remove FRIEZE manually 
by switching to the PC Paintbrush directory and typing 
frieze. WordScan will not operate with FRIEZE in 
memory. ZSoft products that use FRIEZE include PC 
Paintbrush, PC Paintbrush Plus, and Publisher’s 
Paintbrush. Watch out for other programs that use 
memory-resident programs. 


Are the WordScan program files or the disk that they are 
on corrupted? Run the DOS CHKDSK program to find 
out. 


Have you run out of disk space for saving your docu- 
ments? If you’ve been scanning image files or using the 
deferred processing feature, this can happen very quick- 
ly. 


Did you reinstall Windows? WordScan must be installed 
after installing Windows because the WordScan installa- 
tion program modifies the WIN.INI and SYSTEM.INI files 
in your Windows directory. If you reinstall Windows, 
you must either save these files first or reinstall Word- 
Scan afterward. 
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Windows 3.0 with 
Windows 2.x 


If WordScan 
Performance Is 
Wnacceptable 


If you need to have both Windows version 3.0 and an earlier 
version of Windows on your computer, you may encounter a 
problem with confusion in your path statement. If the path to 
an earlier version of Windows comes before the path to 
Windows 3.0, the installation program may not complete 
successfully. Also, with both versions of Windows on your disk, 
you may encounter other problems, such as one version of 
Windows loading the other version’s programs. 


To avoid conflicts such as these, you should put only the path 
to Windows 3.0 in your AUTOEXEC.BAT file, then reboot and 
install WordScan. When you want to run the other version of 
Windows, switch to its directory first, then start Windows. If this 
is nota satisfactory solution, create two batch files, one to set 
the path for and start each version of Windows. 


There are many different areas that can affect the performance 
of WordScan and other programs. Check the following areas: 


LJ Check for a computer virus. Computer viruses corrupt 
your system when you install software from infected dis- 
kettes or download infected programs from modems. 
They can even get into your system over a network con- 
nection. A virus can not only slow down your system it 
can also damage your programs and data. You should 
regularly check your system for viruses by using a pro- 
gram such as VIRUSCAN from McAfee Associates. This 
program is distributed as shareware on many computer 
bulletin boards and information services. Similar 
programs should be available from your dealer. 


(J Make sure there is sufficient disk space for the temporary 
files that Windows and WordScan must create. For best 
performance, your disk should have at least 25% of its 
space free. If it has less space than this, files may be frag- 
mented (that is, written in small blocks in many different 
places on your disk). This results in very slow perfor- 
mance. 
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J “Optimize” your disk regularly. As you (and your 


programs) create, modify, and delete files, DOS uses 
whatever space is available on your disk. When you start 
with a new hard disk, all the data in one file is stored in 
contiguous blocks from beginning to end. However, as 
soon as you erase, enlarge, or move a file from one place 
to another, the disk becomes “fragmented.” This means 
that small portions of files are stored in blocks all over 
the disk. Instead of the disk quickly reading the entire 
file in one operation, it must search for all the individual 
blocks. An optimization program reorganizes your disk 
and puts all blocks of each file into a contiguous space. 
Programs that do this are available from your computer 
dealer and should be run at regular intervals; perhaps 
once per week. 


If performance (or accuracy) is not acceptable, adjust the 
brightness setting and try again. With some scanners, 
depending on the quality of your original document, the 
wrong setting can result in very slow recognition. 


Use the Dot Matrix setting when appropriate. Recogniz- 
ing draft-quality dot-matrix with the normal setting 
results in very slow recognition and poor accuracy. 


Do not attempt to recognize fax image files received in 
“standard” mode. A standard fax has a resolution of 100 x 
200 dpi. Images for WordScan must have the same num- 
ber of dots-per-inch in both horizontal and vertical direc- 
tions. The minimum acceptable fax image is one 
received in “fine” mode, which is 200 x 200 dpi. 


Recognition performance varies with the quality, point 
size, font, and density of the original document. A page 
of 8-point type with 6,000 characters will take much 
longer to recognize than a page of 12-point type with 
2,000 characters. Also, pages with complex formats re- 
quire more recognition time than simple, single-column 


pages. 


IFor More Help 


Don't give up! 


Please try the suggestions in this appendix first, especially the 
one that advises you to run the Technical Assistant program. 
Then work with your dealer to try to resolve the problem. Your 
dealer is closer to you geographically, and more likely to be 
familiar with your configuration. Your dealer can also perform 
hardware repair or replacement, both in and out of warranty. 


If you need additional assistance, please have the following 
information ready before contacting Calera Recognition 
Systems: 


I Computer manufacturer, type, and model 


WordScan version number and serial number 


E 


Scanner manufacturer and model number 


Problem description 


Og 


Appendix C 
Using 
Hand-Held 
scanners 
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Inexpensive hand-held scanners are popular for image 
scanning and can easily be used with WordScan. While 
WordScan cannot scan images directly from a hand-held 
scanner, it can process images saved in one of its supported 
image formats (PCX or TIFF). 


One problem associated with hand-held scanners is getting a 
straight and evenly scanned image. WordScan requires an 
image that is angled no more than 2°. Making smooth, straight 
scans requires practice. You might consider using a 
straightedge or ruler as a guide to keep the hand-held scanner 
straight. 


To use a hand-held scanner: 


1. Scan an area of text and use the software included with the 
scanner to save the image in either TIFF or PCX format. 


2. In WordScan, set the Image Source to Disk. 


3. Click Automatic to begin processing. Or, if desired, click 
Preview so that you can see the scanned images before 
you process them. Specify which image(s) you want to 
process. 


4. Look at each image and make sure it is straight and in the 
proper orientation. Also look for extraneous material and 
use a zone, if necessary, to capture only the portion of the 
image you want. 


5. Click Process One to process the page and load the next 
image. 


Appendix D 
Scanln! 
Word for 
Windows 
Macro 


WordScan includes ScanIn/, a macro that lets you scan and 
recognize documents directly from within Microsoft Word for 
Windows. Installing ScanIn/adds a “Scan/OCR” command to 
the Word for Windows File menu, letting you scan and 
recognize documents without leaving Word for Windows, and 
have them inserted into your word processing document at the 
cursor position. 


Installing When you install WordScan, the installation program copies a 

ScanlIn! special program called MINSTALL.EXE to your hard disk. This 
program installs Scan/n!The WordScan installation program 
then checks to see if you have already installed Word for 
Windows. If so, it asks you if you want to install ScanIn!Enter 
y, and then proceed to Step 3. 


If you have not yet installed Word for Windows, you won't be 
asked if you want to install ScanIn!In this case, follow these 
steps: 


1. If you haven't already done so, install Microsoft Word for 
Windows. It must be properly installed on your hard disk 
prior to installing ScanIn! 


2. Assuming both the directories for Word for Windows 
(\WINWORD, by default) and WordScan (\CALERA\BIN, 
by default) are in your path, from the DOS prompt type: 


MINSTALL <Enter> 


(If Word for Windows is not in your path, you must first 
change to its directory and then type MINSTALL.) 


The MINSTALL program checks for Word for Windows and 
if it is found, uncompresses a file called CALERA.DOC into 
the WINWORD\LIBRARY directory. It then launches Win- 

dows and Word for Windows and opens CALERA.DOC. 


When it finishes, you’ll then see the following on your 
screen: 


Microsoft Word LIBRARY\CALERA.DOC 
Insert Format Utilities Macro Window 


Scanlan! 


WordScanf/Word for Windows 
Scanning and Recognition Macro 


= File Edit View 


By following these mstructons, you wall complete the mstallaton of Scan/n! 
Calera’s WordScan/Microsoft Word for Windows macro. Scan/n’ adds scanning 
and recognıhon capbilites to Word for Windows. 


Double-click on the following button: 


When the dialog box appears, chick to select “ScanIn!” then dick OK. Wait 
while Word for Windows runs the installation macro 


1 


When this document returns to your screen, open the File menu and note that a 
"Scar/OCR . ” command has been added. 


Choose Save All from the File menu YouTl be prompted to “Save global 
glossary and command changes”. Click Yes. If you are prompted to "Save 
large clipboard, click No (unless you have cut or copied and image to the large th 
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Your screen may look different depending on your monitor. 


Important: If the INSTALL button on your screen looks 
like this: 


open the View menu and make sure Field Codes is not 
checked. Also, open the Preferences dialog and make 
sure Show All is not checked. The ScanIn!installation pro- 
gram will not execute if either of these conditions exists. 


Double-click the INSTALL button. You'll then see the fol- 
lowing dialog box: 


Macro to install 


4. Click ScanIn to select it, then click OK. Wait while the 
Scanin/installation program executes. You'll see the screen 
change several times. When finished, the starting screen 
will return. (Note that you can scroll down to see additional 
text on the installation screen.) 


5. The ScanIn/macro has now been merged into the file of 
Word for Windows macros called NORMAL.DOT. Choose 
Save All from the File menu to save these changes. You'll 
see the following prompt: 


Save global glossary and command changes? 
Click Yes. You may then see the following prompt: 

Save large clipboard? 
Click No. 


You have now completed ScamJn/ installation. 


Using Scan In! To use Scanln!, first be sure WordScan is not running, then 
launch Word for Windows by following these steps: 


1. Place the text cursor where you want to insert the recog- 
nized text. 


2. Choose Scan/OCR... from the File menu. The following 


c ? 
oe dialog box appears: 


Shift+F12 
F12 


Print.. Ctri+Shift+F12 
Print Preview 
Print Merge... 
Printer Setup... 


Exit Alt+FA 
Scan/OCR... 


1 EXAMPLES.DOC 
2 CALERA.DOC 

3 TECHREF.DOC 
4 ARTICLE.DOC 


sini C) 
OK 
Scanner 
File Cancel 


Options Settings 
Preview O Dot-Matrix 
Verify O Landscape 


3. Choose the desired options: 


Scan From 


Scanner Scan a document and perform 
recognition. 


File Perform recognition on one or more 
image or fax files. 


Options 


Preview Check to use the Preview window 
(described in Chapter 5). Click to 
remove the X to use Automatic mode 
(just as when you click Automatic 
from the WordScan Setup window. 


Verify Check to use the Verifier (described 
in Chapter 6) after recognition is 
complete and before inserting text 
into your document. Click to remove 
the X if you don’t want to use the 
Verifier. In this case, text will be 
inserted into your document when 
recognition is complete. 


Settings 


Dot-Matrix Check to enable Dot-Matrix 
recognition mode (described in 
Chapter 4). Click to remove the X for 
Normal recognition mode. 


Landscape Check to enable recognition of 
landscape pages. Click to remove the 
X for recognition of portrait pages. 


4. Place your document in your scanner’s document feeder (if 
applicable), then click OK to proceed. 


5. Wait while WordScan scans and recognizes your document 
according to the settings you made in Step 3. Use the 


Preview window and Verifier (if applicable) just as you 
would when you use WordScan. When you click Done or 
Stop Scanning, or when you exit from the Verifier, the 
recognized text is inserted into your Word for Windows 
document. Any images you capture (by using the Preview 
window) are saved in their own files (see below). 


ScanIn! and You can capture images from your document by using 

Images WordScan’s Preview mode. If you draw image zones in the 
Preview window, WordScan saves your images in TIFF 
Packbits format. (You can change this setting if desired by 
choosing Scanner Settings from the Options menu of the 
Preview window; however, other settings may be incompatible 
with Word for Windows.) 


To place images you capture into your Word for Windows 
document, use Word for Windows’ Insert Picture command. 
From the Insert Picture dialog, find your images in the 
following directory: 


\WORDTEMP\W4WMACRO 


(If you specified a different directory name in place of 
WORDTEMP during WordScan installation, select that directory 
name in place of WORDTEMP.) 


The images are named CIMG-nnx.TIF, where nn is the page 
from which you captured the image and xis a letter indicating 
the sequence of multiple images from the same page. For 
example, the third image on the second page you scan would 
be named CIMG-02C.TIF. 


m= Note: ScanIn!always uses the same name (CIMG) for the 
image files it creates; therefore, when you process sub- 
sequent documents that include images, you'll be warned 
that image files already exist and be given the opportunity 
to overwrite the existing images, append the new images 
(that is, renumber them beginning with the next page num- 
ber in sequence), or cancel the operation (and not save the 
new images). If you choose to overwrite, all images in the 
W4WMACRO directory are deleted, regardless of the num- 
ber of new images to be saved. 


Images already exist. 
Do you want to: 


© Replace previously saved images?) 
© Append to the image list? 


© Discard images just defined? 


Glossary 


autoexec.bat A special file in your computer's root (top level) directory that causes 
the computer to execute certain programs whenever it is turned on or 
reset. AUTOEXEC.BAT is an ASCII text file and can be edited by text 
editing programs such as the Microsoft Windows Notepad. 


binary image A graphic image made up of on-and-off (white and black) conditions 
only. Color and shades of gray cannot be represented in a binary 
image; however, shades of gray can be simulated by a process called 
dithering. Recognition only works with binary images. 


bit map A type of graphic image stored in your computer as a series of 
on-and-off conditions. Bit maps can be binary images, or can include 
shades of gray or color. Bit maps are produced by “paint” programs, 
such as PC Paintbrush, Windows Paint, and Gem Paint. Another type 
of image is a vectorimage. Vector images are stored in your computer 
as mathematical equations representing lines and arcs, and are 
converted to bit maps upon printing (either by your computer or by 
your printer). Vector images are produced by “draw” programs such 
as Gem Draw, Adobe Illustrator, Corel Draw, and others. Images 
captured by scanners are stored as bit maps. 


character recognition The capability of an electronic device to convert graphic images of 


characters into computer-usable form; generally into their ASCII or 
ANSI equivalents. 


check box In Microsoft Windows applications (such as WordScan), a small box 
on the screen that can have one of two conditions: checked (enabled) 
or not checked (disabled). See also: option button 


click The action of pressing the mouse button once, then releasing tt. 


complete document The ability to perform recognition on documents, retaining as much 
recognition information as possible about the features and formatting of the 


original, and including the ability to capture images as well as text at 
the same time. 


config.sys A special file in your computer's root (top level) directory that causes 
the computer to load driver programs. Driver programs are used by 
your computer to perform operations beyond the “normal” abilities of 
the operating system, such as enabling communication with a 
scanner. When you turn on or reset your computer, the CONFIG.SYS 
file executes first, followed by the AUTOEXEC.BAT file. 


decolumnization The process of reformatting multi-column documents into a single 
column. Generally, when you are processing a document for use ina 
word processing program, a single column of text is preferable to 
multiple columns. WordScan performs decolumnization on 
documents while saving them. Exception: If you select ASCII text 
format, no decolumnization is performed. 


deferred processing 


dialog box 


dither 


document recognition 


document structure analysis 
(DSA) 


double-click 


drag 


driver 


drop-down list box 


font 


graphic image 


gray scale 


WordScan’s ability to quickly scan your documents, then perform 
recognition later. 


In Microsoft Windows programs (such as WordScan), a window that 
appears in response to some action, requesting additional information. 


A method of representing shades of gray by using various 
combinations of black and white dots in a binary image. Dithering is 
similar to a halftone screening process; however, dithering results in 
lower effective image resolution, as clusters of dots must act as a unit 
to represent a particular shade of gray. 


The ability to capture all the information on a page (text and images) 
and perform not only character recognition, but page structure 
analysis as well. 


A process to which a document is subjected when you save it as a 
finished document. DSA uses information about the position of 
various elements and applies a high degree of intelligence to properly 
format the resulting document. 


The action of pressing the mouse button twice in rapid succession, 
generally to make a selection and initiate an operation in one step. 


The action of pressing and holding down the mouse button while 
moving the mouse, generally causing a change in the position or size 
of a selected object on the screen. 


A program that provides an interface between the computer and an 
application program. For example, a scanner driver is a program that 
lets WordScan communicate with your scanner. Driver programs 
generally are kept separate from the application program so that the 
application program can work with a variety of devices just by 
changing driver programs. 


A Microsoft Windows feature that displays a selected item. Clicking 
on the downward pointing arrow opens a list box that drops down 
from the selected item and enables you to select from among a list of 


items. 
A set of characters that is consistent in style, size, and weight. 


Line art, drawings, photographs, and so forth. In WordScan, any 
element of a page that is captured as an image, regardless of what it 
is. Recognition is not performed on images. They are merely captured 
and converted to the specified image type. 


An image stored with multiple bits per pixel that can then be 
displayed or printed in shades of gray. If four bits are stored for each 
pixel, sixteen different shades of gray can be represented and 
reproduced. Gray scale images require large amounts of storage space 
and cannot be used for recognition. 
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halftone 


icon 


image 


image format 


insert mode 


line art 


list box 


marker 


menu bar 


noise 


OCR 


omnifont character recognition 


An image that has been converted into a pattern of dots so as to be 
printable on a binary device (such as a printing press). In printing, 
continuous tone images such as photographs are photographed 
through a fine mesh. This breaks the image into a pattern of dots that 
represent the original shades of gray. Once an image is halftoned, it 
can often be scanned as a binary image. 


A visual representation of an object. In Microsoft Windows, each 
application program is represented by an icon. 


Line art, photographs, drawings, and so forth. In WordScan terms, any 
area of the page that is specified as image, regardless of its actual 
content. Images are not recognized. They are merely captured and 
converted to the image format you specified. 


The file format that WordScan uses when it captures and saves 
images. Image formats include ZSoft's PCX format, Tagged Image File 
Format (TIFF) and Calera’s Processed Document Architecture (PDA) 
format. 


The Verifier has two typing modes, insert mode and overtype 
mode. Insert mode moves characters to the right as you type. See 
also: overtype mode 


An image composed of lines. Usually line art is black and white, and 
never contains continuous tones. See also: gray scale 


In Microsoft Windows applications (including WordScan), a box that 
contains a list of items. When the list contains more items than will fit 
in the box, a scroll bar facilitates moving to different parts of the list. 
See also: drop-down list box 


A character or code that denotes an uncertain character or word. 


In Microsoft Windows applications (including WordScan), a menu bar 
appears near the top of a window from which you can pull down 
menus and make selections. WordScan has menu bars in the Setup 
window and in the Verifier window. 


In recognition terms, any non-textual data on the original page (or on 
the scanner) that contributes to misrecognition. Includes coffee stains, 
staple holes, handwritten notations, torn edges, and otherwise dirty 
pages. 


Optical character recognition. WordScan is in the category of OCR 
systems; however, WordScan performs many more tasks as well, 
including complete document recognition. 


The ability to recognize all fonts. In practical terms, the ability to 


recognize virtually any font, any typeface, as well as any font style. 
WordScan is an omnifont system. 


overtype mode 


pixel 


Preview window 


resolution 


scanner 


scanning 


scroll bar 


Settings file 


Setup window 


template 


text format 


user dictionary 
zone 


zone indicator 


The Verifier has two typing modes, insert mode and overtype 
mode. Overtype mode replaces characters as you type. See also: 
insert mode 


A word derived from “picture element.” The smallest unit on a display 
screen or printer (or in an image file) that can be individually 
controlled. Display screens generally have 50 to 100 pixels per inch, 
while laser printers can have 300 or more pixels per inch. 


A WordScan feature that lets you define a text or image zone prior to 
performing recognition on a page. 


The capability of a scanner, printer, or other device to resolve 
information, expressed in dots-per-inch. WordScan is designed to 
give optimum results with 300 dot-per-inch images. 


A device for capturing graphic images of pages. 


The process of converting a printed page into digital form. Scanning 
does not mean recognition. 


A device that appears at the right or bottom edge of a list box or 
window to facilitate viewing different areas. 


A file that contains all of WordScan’s settings, including Reading for, 
Page Orientation, Type Quality, Image Source, Brightness, selected 
user dictionary, and optionally, the active template. 


The control center of WordScan where you can change settings and 
invoke other operations, such as Automatic or Preview mode. 


An electronic mask that specifies a zone of text or image that you 
want to recognize or capture. Templates are used to selectively 
include or exclude portions of pages, especially forms. 


The file format that WordScan uses when it saves a finished 
document. Text formats are available to match most popular word 


processing and spreadsheet programs. 


A list of words that is used during recognition in conjunction with the 
built-in recognition dictionary. 


A rectangular boundary that surrounds an area of text or image to be 
recognized or captured. Zones can be defined in Preview mode. 


A label in the Verifier that indicates the beginning of a new text or 
image zone. If you define multiple zones on a single page, a zone 
indicator will appear at the beginning of each zone when you proof 
your document. 
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A 
adding pages 33 
adding words to a user dictionary 40 
automatic processing 13 
Automatic Scan setting 59 


B 
basic installation checklist 109 
binary image 37 
binder holes 102 
Bold 75 
brightness contro! 3] 
broken lines 100 
button 
Add Pages 33 
Automatic 13, 32 
Done 14, 22 
Preview 19, 52 
Process 83 
Process One 21, 55 
Process Stack 55 
ReScan 20, 56 
Scan 19 
Stop Scanning 14, 22 
Turn Pages Over 33 


C 
Change 74 
Change Next 74 
Change Rest 74 
changing settings 12 
Character menu 
Bold 75 
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Automatic Scan 59 
Delete All Intermediate Files 43 
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Mark Suspicious Characters 35 
Mark Suspicious Words 35 
Preserve Template 58 
Save Document for Proofing 81 
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checking recognition 94 


Clear 72 
Clear Markers 72 
colored backgrounds 101 
commands 
Verifier 63 
Contrast 37 
Control menu 57 
Cut Page 70 


D 
darken 31 
default settings 49 
deferred documents 
processing 83 
deferred processing 80 
defining a template 88 
delete 
settings 48 
template 91 
user dictionary 40 
words from a user dictionary 41 
zone 54 
Delete All Intermediate Files 43 
disk files 84 
disk image source 31 
disk space requirements 4 
dithered image 37 
document naming 44 
dot-matrix type quality 30 
double-sided recognition 34 
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dither type 37 
Image File Type 85 
Image Format 38, 43 
Language 35 
Preview Resolution 60 
Scan Resolution 38 
Text Format 35, 43 


E 
Edit Menu 
Clear 72 
Clear Markers 72 
Undo 72 
Whiteout 72 
editing text 63 


‘entering special characters 78 


IExit Verifier 71 


Ifax files 84 
Ifax image source 31 
IFile menu 
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Cut Page 70 
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Exit Verifier 71 
Go to Page 70 


New 45 
Next Page 70 
Open 82, 87 


Paste Page 71 
Retrieve Settings 48 
Retrieve Template 90 
Save 16, 42, 81 
Save Settings 46, 89 
Save Template 88 
Save Unconverted 82, 87 
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Find 73 
Find Next 73 
Fit to Window 57, 65 
flipped orientation 30 
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format 
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text 16, 26, 43 
Full Scale 57, 65 


G 
Go to Page 70 
Grid Marks 59 


H 
Half Scale 57, 65 
hand scanners 118 
handwritten notations 101 
hard disk requirements 4 
help 10 
Help menu 10, 78 
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ABC 20 
Brightness 31 
Disk 84 
Disk Image Source 31 


Dot Matrix Type Quality 30 


Fax 84 
Fax Image Source 31 
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Flipscape Orientation 30 
image tool 53 
Landscape Orientation 30 
Normal Type Quality 30 
numeric tool 53 
Portrait Orientation 30 
Reading For Image 29 
Reading For Template 29 
Reading For Text 29 
Reading For Text&Img 29 
Scanner Image Source 31 
selection tool 54 
templates 91 
text tool 20, 53 
text&img tool 53 
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DCX 85 
dither 37 
fax types 86 
file type 85 
indicator 64 
PCX 85 
PDA 85 
TIFF 85 
view 78 
Image File Type 85 
Image Format 38, 43 
image scans 104 
image tool 53 
images 
viewing 64 
importing a user dictionary 40 
improving recognition 94 
installation 6 
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basic checklist 109 
installing Windows 6 
installing WordScan software 6 
Italic 75 


J 


joined characters 98 


L 
landscape orientation 30 
Language 35 
launching WordScan 12 
layout problems 103 
lighten 31 
loading scanner 13 
Look for Marker 72 


M 

macro 
Word for Windows 120 
ScanIn! 120 


margin notes 102 
Mark Suspicious Characters 35 
Mark Suspicious Words 35 
maximizing 56 
maximizing windows 20 
memory requirements 4 
message 
Cannot delete region marker 69 
No more markers 25, 65, 73 
No more pages, exit? 25, 70 
Save Current Document? 46 
minimum equipment 4 
misplaced capitals 98 
missing text 99 
moving a zone 54 


N 
naming documents 44 
Next Page 70 
noise 102 
non-keyboard characters 78 
nonsense 100 
Normal (text attribute) 75 


number 
recognition 53 
numeric tool 53 


O 
OCR 2 
Open 87 
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Options menu 
Defer Processing 80 
Preview Settings 58 
Process Deferred Docs 83 


Remove Duplicate Markers 66, 77 


Scanner Settings 36 
Show Char Markers 76 
Show Text Regions 76 
Show Word Markers 76 
Text Settings 35 
User Dictionary 23, 39 
Verifier 15, 24, 62 
View Image 78 
orientation 14, 30 
overview 2 


P 
page orientation 30 
paper clips 102 
Paste Page 71 
path 
setting 10 
portrait orientation 30 
Preserve Template 58 
preview operation 19 
Preview Resolution 60 
Preview Settings 
Automatic Scan 59 
Grid Marks 58 - 59 
Preview Resolution 60 
Preview window 52 
maximizing 20 
scaling the image 20 
processing deferred documents 83 
processing pages 
automatic 13 
Progress Monitor 13, 21, 32 
proofing 15, 24, 62 


Q 
Quarter Scale 57, 65 


R 
radio button 

A4 38 

Binary 37 

Dither 37 

Legal 38 
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templates 91 
rebooting 10 
frecognition 

checking 94 

double-sided 34 

improving 94 
recognizing 

disk files 84 

faxes 84 

numbers only 53 
FRemove Duplicate Markers 66, 77 
rreplacing words in a user dictionary 41 
rresizing a zone 54 
IRetrieve Template 90 
Iretrieving settings 48 


S 
‘Sample document #2 18 
‚sample page #1 11 
save 
document 42 
settings 46 
Save Markers As 44 
Save Settings 89 
Save Template 88 
Save Unconverted 87 
saving 
finished document 16 
scaling the Preview image 20 
Scan Resolution 38 
ScanIn! macro 120 
scanner 
loading 13 
Scanner image source 31 
scanner installation 6 


scanner settings 

Contrast 37 

Image Format 38 

Image Type 37 

Page Size (to scan) 38 

Scan Resolution 38 
scanning 

hand scanners 118 
Search menu 

Change 74 

Find 73 

Find Next 73 

Look for Marker 72 
selecting a zone 54 
selecting text 67 
selection tool 54 
setting the path 10 
settings 

changing 12 

default 49 

deleting 48 

table of saved settings 48 
Setup window 12, 28 
Show Char Markers 76 
Show Text Regions 76 
Show Word Markers 76 
sideways text 102 
special characters 78 
split characters 98 
stains 102 
staple holes 102 
starting a new document 45 
starting WordScan 12 
suspicious 

characters 63 

words 63 
system requirements 4 


T 
Technical Assistant program 108 
template 

defining 88 

deleting 91 

retrieving 90 

saving 88 


saving with settings 89 Yy 
viewing 90 Verifier 15, 24, 62 
Template icon 29 command summary 68 
Template menu 60 commands 63 
text format 16, 26, 35, 43 exiting 16 
text settings 35 Help menu 78 
Language 35 using the mouse 67 
Mark Suspicious Characters 35 verifying 15, 24, 62 
Mark Suspicious Words 35 
Text Format 35 
text tool 20, 53 


View Image 78 
viewing a template 90 
viewing images 64 


text zone 

defining 20 W 
text&img tool 53 warranty.pcx file 110 
troubleshooting 108 Whiteout 72 

window 

U Preview 52 
uncertain Setup 12, 28 

character 63 Verifier 62 

word 63 Windows 
Underline 75 multiple versions 113 
Undo 72 Word for Windows macro 120 
user dictionary 23, 39 

add 40 Z 

adding words in Verifier 66 zone 53 

automatically use 41 deleting 54 

delete 40 moving 54 

delete (word) 41 resizing 54 

import 40 selecting 54 

naming 24, 39 Zoom menu 20 

new 23, 39 Fit to Window 57, 65 

open 24, 39 Full Scale 57, 65 

replace 41 Half Scale 57, 65 


using help 10 Quarter Scale 57, 65 


CALERA 


RECOGNITION SYSTEMS 


Tutorial Sample Page #1 


Congratulations on buying a Calera recognition product! You 
have joined most of the Fortune 500 in purchasing the most 
accurate character recognition available today. 


This product will read typewritten, typeset, laser print, and 
even draft-quality dot-matrix documents. 


Our products are used in many different applications. Here 
are some examples: 


1. Word Processing: Scanning reports, memos, contracts and 
even correspondence. 


2. Desktop Publishing: Generating newsletters by scanning 
articles from many different sources, revising presentations, 
updating technical manuals. _ 


3. Full-Text Databases: Capturing competitive information, 
storing magazine and journal articles, scanning resumes. 


4. Spreadsheets: Consolidating forecasts and budgets, 
analyzing competitive financial statements. 


You may have purchased this product with one application in 
mind, but we think you’ll quickly find other uses for it! Calera 
Recognition Systems provides character recognition products for 
all your needs. 
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Tutorial Sample Page #2 


Enter Text Retrieval 
Databases 


Let’s go back a few years to 


There is a common thread 
through these events. The 
historical example would 


have benefited from the —__ ~ 


the famous Scopes "Monkey 
Trial" of 1925 when dozens of 
law clerks must have waded 
through hundreds of dusty 
volumes so that chapter and 
verse, theory and counter 
theory, could be quoted 
accurately. "Labor intensive" 
hardly begins to describe 
these efforts. 

Now, imagine events a few 
short years from now. 
Scientists are designing a 
self-contained space habitat, 
and must enter thousands of 
research documents into a 
knowledge base of 
unprecedented sophistication. 
Or, researchers of the 21st 
Century need to sift through a 
vast library of medical 
journals for seemingly 
random connections to come 
up with a cure for the 
common cold. 


ability to look through masses 
of data for common 
connections where no 
structured database existed. 
And, the achievements of the 
future may not be feasible 
without technologies now 
known as text retrieval 
databases using full-text 
indexing. 

What exactly does a text 
retrieval database do for your 
productivity? In a nutshell, it 
allows you to collect, locate 
and organize information 
with unprecedented speed, 
accuracy and flexibility. The 
applications are unlimited. 


Text Retrieval Databases 
vs. Relational Databases 
A relational database uses 


fields to organize data. Each 
field has one entry usually 


represented by one word. 
Text retrieval databases, on 
the other hand, are not 
restricted to information 
contained in a limited number 
of fields. Rather, they rely on 
key words which give the 
user far greater flexibility. 
Any word in the entire 
document can be a key word 
and you don’t have to decide 
ahead of time. 


Calera’s Day in Court 


Not so long ago, even the 
simplest litigation process 
required hours of research in 
a law library, days of 
laborious keyboard entry into 
a litigation support database, 


"and sometimes court delays —— 


while evidence was 
crossreferenced. 

With OCR and a litigation 
support package, every 
document important to a case 
can be placed in a database 
with speed and accuracy. 

An example of its uses is 
illustrated by the Association 
of Trial Lawyers of America. 
This organization uses Calera 
products for educational 
materials, resource 
documents and research 
information-sharing cases. 
The organizations first project 
was to produce a 1200 page 
book for its annual 
convention. The association 
estimates that they saved 
$30,000 by eliminating 
contractor costs for typing, 
typesetting and layout. 
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